T Nips.mp4 -
: A method presented at NeurIPS 2024 that uses deep text-to-image/video diffusion models to control the appearance and structure of generated media.
: A framework for enhancing fine-grained temporal understanding in video Large Language Models (LLMs), appearing in NeurIPS 2024 proceedings . T nips.mp4
If you are looking for a specific video demo of these "deep text" technologies, researchers often upload their conference presentations to the NeurIPS Virtual site as .mp4 files. : A method presented at NeurIPS 2024 that

