THUDM
/

CogVideoX1.5-5B-SAT

Image-to-Video

Safetensors

English

Model card Files Files and versions Community

zR commited on 9 days ago

Commit

3035108

•

1 Parent(s): 56dad16

init

Browse files

Files changed (3) hide show

LICENSE +71 -0
README.md +91 -0
README_zh.md +81 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,71 @@

+The CogVideoX License
+1. Definitions
+“Licensor” means the CogVideoX Model Team that distributes its Software.
+“Software” means the CogVideoX model parameters made available under this license.
+2. License Grant
+Under the terms and conditions of this license, the licensor hereby grants you a non-exclusive, worldwide, non-transferable, non-sublicensable, revocable, royalty-free copyright license. The intellectual property rights of the generated content belong to the user to the extent permitted by applicable local laws.
+This license allows you to freely use all open-source models in this repository for academic research. Users who wish to use the models for commercial purposes must register and obtain a basic commercial license in https://open.bigmodel.cn/mla/form .
+Users who have registered and obtained the basic commercial license can use the models for commercial activities for free, but must comply with all terms and conditions of this license. Additionally, the number of service users (visits) for your commercial activities must not exceed 1 million visits per month.
+If the number of service users (visits) for your commercial activities exceeds 1 million visits per month, you need to contact our business team to obtain more commercial licenses.
+The above copyright statement and this license statement should be included in all copies or significant portions of this software.
+3. Restriction
+You will not use, copy, modify, merge, publish, distribute, reproduce, or create derivative works of the Software, in whole or in part, for any military, or illegal purposes.
+You will not use the Software for any act that may undermine China's national security and national unity, harm the public interest of society, or infringe upon the rights and interests of human beings.
+4. Disclaimer
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
+5. Limitation of Liability
+EXCEPT TO THE EXTENT PROHIBITED BY APPLICABLE LAW, IN NO EVENT AND UNDER NO LEGAL THEORY, WHETHER BASED IN TORT, NEGLIGENCE, CONTRACT, LIABILITY, OR OTHERWISE WILL ANY LICENSOR BE LIABLE TO YOU FOR ANY DIRECT, INDIRECT, SPECIAL, INCIDENTAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES, OR ANY OTHER COMMERCIAL LOSSES, EVEN IF THE LICENSOR HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
+6. Dispute Resolution
+This license shall be governed and construed in accordance with the laws of People’s Republic of China. Any dispute arising from or in connection with this License shall be submitted to Haidian District People's Court in Beijing.
+Note that the license is subject to update to a more comprehensive version.  For any questions related to the license and copyright, please contact us at [email protected].
+1. 定义
+“许可方”是指分发其软件的 CogVideoX 模型团队。
+“软件”是指根据本许可提供的 CogVideoX 模型参数。
+2. 许可授予
+根据本许可的条款和条件，许可方特此授予您非排他性、全球性、不可转让、不可再许可、可撤销、免版税的版权许可。生成内容的知识产权所属，可根据适用当地法律的规定，在法律允许的范围内由用户享有生成内容的知识产权或其他权利。
+本许可允许您免费使用本仓库中的所有开源模型进行学术研究。对于希望将模型用于商业目的的用户，需在 https://open.bigmodel.cn/mla/form 完成登记并获得基础商用授权。
+经过登记并获得基础商用授权的用户可以免费使用本模型进行商业活动，但必须遵守本许可的所有条款和条件。
+在本许可证下，您的商业活动的服务用户数量（访问量）不得超过100万人次访问 / 每月。如果超过，您需要与我们的商业团队联系以获得更多的商业许可。
+上述版权声明和本许可声明应包含在本软件的所有副本或重要部分中。
+3.限制
+您不得出于任何军事或非法目的使用、复制、修改、合并、发布、分发、复制或创建本软件的全部或部分衍生作品。
+您不得利用本软件从事任何危害国家安全和国家统一、危害社会公共利益、侵犯人身权益的行为。
+4.免责声明
+本软件“按原样”提供，不提供任何明示或暗示的保证，包括但不限于对适销性、特定用途的适用性和非侵权性的保证。
+在任何情况下，作者或版权持有人均不对任何索赔、损害或其他责任负责，无论是在合同诉讼、侵权行为还是其他方面，由软件或软件的使用或其他交易引起、由软件引起或与之相关 软件。
+5. 责任限制
+除适用��律禁止的范围外，在任何情况下且根据任何法律理论，无论是基于侵权行为、疏忽、合同、责任或其他原因，任何许可方均不对您承担任何直接、间接、特殊、偶然、示范性、 或间接损害，或任何其他商业损失，即使许可人已被告知此类损害的可能性。
+6.争议解决
+本许可受中华人民共和国法律管辖并按其解释。 因本许可引起的或与本许可有关的任何争议应提交北京市海淀区人民法院。
+请注意，许可证可能会更新到更全面的版本。 有关许可和版权的任何问题，请通过 [email protected] 与我们联系。

README.md ADDED Viewed

	@@ -0,0 +1,91 @@

+---
+license: other
+language:
+- en
+base_model:
+- THUDM/CogVideoX-5b
+- THUDM/CogVideoX-5b-I2V
+pipeline_tag: image-to-image
+---
+# CogVideoX1.1-5B-SAT
+<p style="text-align: center;">
+  <div align="center">
+  <img src=https://modelscope.oss-cn-beijing.aliyuncs.com/resource/cogvideologo.svg width="50%"/>
+  </div>
+  <p align="center">
+  <a href="README_zh.md">📄 中文阅读</a> |
+  <a href="https://github.com/THUDM/CogVideo">🌐 Github </a> |
+  <a href="https://arxiv.org/pdf/2408.06072">📜 arxiv </a>
+</p>
+<p align="center">
+📍 Visit <a href="https://chatglm.cn/video?lang=en?fr=osm_cogvideo">QingYing</a> and <a href="https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9">API Platform</a> to experience commercial video generation models.
+</p>
+CogVideoX is an open-source video generation model originating from [Qingying](https://chatglm.cn/video?fr=osm_cogvideo). CogVideoX1.1 is the upgraded version of the open-source CogVideoX model.
+The CogVideoX1.1-5B series model supports **10-second** videos and higher resolutions. The `CogVideoX1.1-5B-I2V` variant supports **any resolution** for video generation.
+This repository contains the SAT-weight version of the CogVideoX1.1-5B model, specifically including the following modules:
+## Transformer
+Includes weights for both I2V and T2V models. Specifically, it includes the following modules:
+```
+├── transformer_i2v
+│   ├── 1000
+│   │   └── mp_rank_00_model_states.pt
+│   └── latest
+└── transformer_t2v
+    ├── 1000
+    │   └── mp_rank_00_model_states.pt
+    └── latest
+```
+Please select the corresponding weights when performing inference.
+## VAE
+The VAE part is consistent with the CogVideoX-5B series and does not require updating. You can also download it directly from here. Specifically, it includes the following modules:
+```
+└── vae
+    └── 3d-vae.pt
+```
+## Text Encoder
+Consistent with the diffusers version of CogVideoX-5B, no updates are necessary. You can also download it directly from here. Specifically, it includes the following modules:
+```
+├── t5-v1_1-xxl
+   ├── added_tokens.json
+   ├── config.json
+   ├── model-00001-of-00002.safetensors
+   ├── model-00002-of-00002.safetensors
+   ├── model.safetensors.index.json
+   ├── special_tokens_map.json
+   ├── spiece.model
+   └── tokenizer_config.json
+0 directories, 8 files
+```
+## Model License
+This model is released under the [CogVideoX LICENSE](LICENSE).
+## Citation
+```
+@article{yang2024cogvideox,
+  title={CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer},
+  author={Yang, Zhuoyi and Teng, Jiayan and Zheng, Wendi and Ding, Ming and Huang, Shiyu and Xu, Jiazheng and Yang, Yuanming and Hong, Wenyi and Zhang, Xiaohan and Feng, Guanyu and others},
+  journal={arXiv preprint arXiv:2408.06072},
+  year={2024}
+}
+```

README_zh.md ADDED Viewed

	@@ -0,0 +1,81 @@

+# CogVideoX1.1-5B-SAT
+<p style="text-align: center;">
+  <div align="center">
+  <img src=https://github.com/THUDM/CogVideo/raw/main/resources/logo.svg width="50%"/>
+  </div>
+  <p align="center">
+  <a href="README_en.md">📄 Read in English</a> |
+  <a href="https://github.com/THUDM/CogVideo">🌐 Github </a> |
+  <a href="https://arxiv.org/pdf/2408.06072">📜 arxiv </a>
+</p>
+<p align="center">
+📍 前往<a href="https://chatglm.cn/video?fr=osm_cogvideox"> 清影</a> 和 <a href="https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9"> API平台</a> 体验商业版视频生成模型
+</p>
+CogVideoX是 [清影](https://chatglm.cn/video?fr=osm_cogvideo) 同源的开源版本视频生成模型。CogVideoX1.1 是 CogVideoX 开源模型的升级版本。
+CogVideoX1.1-5B 系列模型支持 **10秒** 长度的视频和更高的分辨率，其中 `CogVideoX1.1-5B-I2V` 支持 **任意分辨率** 的视频生成。
+本仓库存放了 CogVideoX1.1-5B 模型的 SAT 权重版本。具体来说，包含了如下模块:
+## Transformer
+包含了I2V和T2V两个模型的权重。具体来说，包含了如下模块:
+```
+├── transformer_i2v
+│   ├── 1000
+│   │   └── mp_rank_00_model_states.pt
+│   └── latest
+└── transformer_t2v
+    ├── 1000
+    │   └── mp_rank_00_model_states.pt
+    └── latest
+```
+请在推理的时候选择对应的权重进行推理。
+## VAE
+VAE部分与 CogVideoX-5B 系列一致，无需更新。你也可以直接从这里下载。具体来说，包含了如下模块:
+```
+└── vae
+    └── 3d-vae.pt
+```
+## Text Encoder
+与 diffusers 版本的 CogVideoX-5B 一致，无需更新。
+你也可以直接从这里下载。具体来说，包含了如下模块:
+```
+├── t5-v1_1-xxl
+   ├── added_tokens.json
+   ├── config.json
+   ├── model-00001-of-00002.safetensors
+   ├── model-00002-of-00002.safetensors
+   ├── model.safetensors.index.json
+   ├── special_tokens_map.json
+   ├── spiece.model
+   └── tokenizer_config.json
+0 directories, 8 files
+```
+## 模型协议
+该模型根据 [CogVideoX LICENSE](LICENSE) 许可证发布。
+## 引用
+```
+@article{yang2024cogvideox,
+  title={CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer},
+  author={Yang, Zhuoyi and Teng, Jiayan and Zheng, Wendi and Ding, Ming and Huang, Shiyu and Xu, Jiazheng and Yang, Yuanming and Hong, Wenyi and Zhang, Xiaohan and Feng, Guanyu and others},
+  journal={arXiv preprint arXiv:2408.06072},
+  year={2024}
+}
+```