Text-To-Video Generation Based On Diffusion Model