Update README.md
Browse files
README.md
CHANGED
@@ -17,13 +17,88 @@ Abstract: *This work explores expanding the capabilities of large language model
|
|
17 |
|
18 |
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/634e15aec1ce28f1de91c470/CwSCmyJizQderIYC8CaJ4.mp4"></video>
|
19 |
|
20 |
-
|
21 |
## Method
|
22 |
|
23 |
Overview of our method. LLaMA-Mesh unifies text and 3D mesh in a uniform format by representing the numerical values of vertex coordinates and face definitions of a 3D mesh as plain text. Our model is trained using text and 3D interleaved data in an end-to-end manner. Therefore, our model can generate both text and 3D meshes in a unified model.
|
24 |
|
25 |
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/634e15aec1ce28f1de91c470/0DzHXhoxonG5ZMeTysA6s.jpeg)
|
26 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
## BibTeX
|
28 |
|
29 |
```bibtex
|
|
|
17 |
|
18 |
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/634e15aec1ce28f1de91c470/CwSCmyJizQderIYC8CaJ4.mp4"></video>
|
19 |
|
|
|
20 |
## Method
|
21 |
|
22 |
Overview of our method. LLaMA-Mesh unifies text and 3D mesh in a uniform format by representing the numerical values of vertex coordinates and face definitions of a 3D mesh as plain text. Our model is trained using text and 3D interleaved data in an end-to-end manner. Therefore, our model can generate both text and 3D meshes in a unified model.
|
23 |
|
24 |
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/634e15aec1ce28f1de91c470/0DzHXhoxonG5ZMeTysA6s.jpeg)
|
25 |
|
26 |
+
### Model Developer: Base model weight is from Meta. Finetuned by Nvidia
|
27 |
+
|
28 |
+
## Third-Party Community Consideration:
|
29 |
+
|
30 |
+
This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party’s requirements for this application and use case; see link to Non-NVIDIA [Llama 3.1 Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md).
|
31 |
+
|
32 |
+
### License/Terms of Use:
|
33 |
+
This model is ready for non-commercial use.
|
34 |
+
This model is distributed under the NSCLv1 license and was built using Llama 3.1.
|
35 |
+
Attribution: "Built with Llama" as per the Llama 3.1 Community License Agreement. Please refer to Llama 3.1 licensing terms for further details.
|
36 |
+
|
37 |
+
## Reference(s):
|
38 |
+
Llama 3.1 [Github](https://github.com/meta-llama/llama-models/tree/main/models/llama3_1)
|
39 |
+
|
40 |
+
## Model Architecture:
|
41 |
+
**Architecture Type:** Transformer
|
42 |
+
*Network Architecture:* Llama 3.1
|
43 |
+
|
44 |
+
## Input:
|
45 |
+
**Input Type(s):** Text
|
46 |
+
|
47 |
+
**Input Format(s):** String
|
48 |
+
|
49 |
+
**Input Parameters:** 1D
|
50 |
+
|
51 |
+
**Other Properties Related to Input:** Max token length 8k
|
52 |
+
|
53 |
+
## Output:
|
54 |
+
**Output Type(s):** Text
|
55 |
+
|
56 |
+
**Output Format:** String
|
57 |
+
|
58 |
+
**Output Parameters:** 1D
|
59 |
+
|
60 |
+
**Other Properties Related to Output:** Max token length 8k
|
61 |
+
|
62 |
+
**Supported Hardware Microarchitecture Compatibility:**
|
63 |
+
* NVIDIA Ada
|
64 |
+
|
65 |
+
**Supported Operating System(s):**
|
66 |
+
* Linux
|
67 |
+
|
68 |
+
## Model Version(s):
|
69 |
+
Llama 3.1 8B mesh
|
70 |
+
|
71 |
+
# Training Dataset:
|
72 |
+
Please refer to [Llama 3.1 Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md) for information on Training, Testing, and Evaluation Datasets).
|
73 |
+
|
74 |
+
The data is curated through converting Objaverse mesh data into text string (in the format as vertex index, face index as string). The model is finetuned on the curated dataset with 32 GPU.
|
75 |
+
|
76 |
+
[**Objaverse**](https://objaverse.allenai.org/explore/)
|
77 |
+
|
78 |
+
**Data Collection Method by dataset**: Unknown
|
79 |
+
|
80 |
+
**Labeling Method by dataset**: Unknown
|
81 |
+
|
82 |
+
**Properties:** We use 30k mesh data, which is a subset from the Objaverse. We filter the Objaverse dataset by the number of faces, and only keep the shape with the number of faces less than 500. They are saved as obj file format.
|
83 |
+
|
84 |
+
**Dataset License(s):** The use of the dataset as a whole is licensed under the ODC-By v1.0 license.
|
85 |
+
|
86 |
+
[**UltraChat**](https://github.com/thunlp/UltraChat)
|
87 |
+
|
88 |
+
**Data Collection Method by dataset**: Unknown
|
89 |
+
|
90 |
+
**Labeling Method by dataset**: Unknown
|
91 |
+
|
92 |
+
## Inference:
|
93 |
+
**Engine**: Pytorch
|
94 |
+
|
95 |
+
**Test Hardware**: A100
|
96 |
+
|
97 |
+
## Ethical Considerations:
|
98 |
+
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
|
99 |
+
|
100 |
+
Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).
|
101 |
+
|
102 |
## BibTeX
|
103 |
|
104 |
```bibtex
|