ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume Text Generation • 8B • Updated 24 days ago • 173
ligeng-dev/tw-data-train_classified-8node-resume Text Generation • 8B • Updated 24 days ago • 971
ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume Text Generation • 8B • Updated 24 days ago • 911
ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume Text Generation • 8B • Updated 24 days ago • 173
ligeng-dev/tw-data-train_classified-8node-resume Text Generation • 8B • Updated 24 days ago • 971
ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume Text Generation • 8B • Updated 24 days ago • 911
ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix Text Generation • 8B • Updated 26 days ago • 909
ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix Text Generation • 8B • Updated 26 days ago • 909
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper • 2408.10188 • Published Aug 19, 2024 • 52