scfengv
/

TVL_GameLayerClassifier

Text Classification

Safetensors

Chinese

bert

multi-label

Eval Results

Model card Files Files and versions Community

scfengv commited on Oct 16, 2024

Commit

b900f3a

1 Parent(s): fd3ed5b

Update README

Browse files

Files changed (1) hide show

README.md +73 -57

README.md CHANGED Viewed

@@ -33,31 +33,86 @@ model-index:
             type: F1 score (Macro)
             value: 0.993694
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [scfengv](https://huggingface.co/scfengv)
-- **Model type:** BERT Multi-label Text Classification
-- **Language:** Chinese (Zh)
-- **Finetuned from model:** [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese)
-### Model Sources
-<!-- Provide the basic links for the model. -->
-- **Repository:** [scfengv/NLP-Topic-Modeling-for-TVL-livestream-comments](https://github.com/scfengv/NLP-Topic-Modeling-for-TVL-livestream-comments)
-## How to Get Started with the Model
-Use the code below to get started with the model.
 ```python
 import torch
@@ -79,47 +134,8 @@ with torch.no_grad():
 print(predictions)
 ```
-## Training Details
-- **Hardware Type:** NVIDIA Quadro RTX8000
-- **Library:** PyTorch
-- **Hours used:** 2hr 13mins
-### Training Data
-- [scfengv/TVL-game-layer-dataset](https://huggingface.co/datasets/scfengv/TVL-game-layer-dataset)
-  - train
-### Training Hyperparameters
-The model was trained using the following hyperparameters:
-```
-Learning rate: 1e-05
-Batch size: 32
-Number of epochs: 10
-Optimizer: Adam
-Loss function: torch.nn.BCEWithLogitsLoss()
-```
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-- [scfengv/TVL-game-layer-dataset](https://huggingface.co/datasets/scfengv/TVL-game-layer-dataset)
-  - validation
-  - Remove Emoji
-  - Emoji2Desc
-  - Remove Punctuation
-### Results (validation)
-- Accuracy score: 0.985764
-- F1 score (Micro): 0.993132
-- F1 Score (Macro): 0.993694

             type: F1 score (Macro)
             value: 0.993694
 ---
+# Model Details of TVL_GameLayerClassifier
+## Base Model
+This model is fine-tuned from [google-bert/bert-base-chinese](https://huggingface.co/google-bert/bert-base-chinese).
+## Model Architecture
+- **Type**: BERT-based text classification model
+- **Hidden Size**: 768
+- **Number of Layers**: 12
+- **Number of Attention Heads**: 12
+- **Intermediate Size**: 3072
+- **Max Sequence Length**: 512
+- **Vocabulary Size**: 21,128
+## Key Components
+1. **Embeddings**
+   - Word Embeddings
+   - Position Embeddings
+   - Token Type Embeddings
+   - Layer Normalization
+2. **Encoder**
+   - 12 layers of:
+     - Self-Attention Mechanism
+     - Intermediate Dense Layer
+     - Output Dense Layer
+     - Layer Normalization
+3. **Pooler**
+   - Dense layer for sentence representation
+4. **Classifier**
+   - Output layer with 5 classes
+## Training Hyperparameters
+The model was trained using the following hyperparameters:
+```
+Learning rate: 1e-05
+Batch size: 32
+Number of epochs: 10
+Optimizer: Adam
+Loss function: torch.nn.BCEWithLogitsLoss()
+```
+## Training Infrastructure
+- **Hardware Type:** NVIDIA Quadro RTX8000
+- **Library:** PyTorch
+- **Hours used:** 2hr 13mins
+## Model Parameters
+- Total parameters: ~102M (estimated)
+- All parameters are in 32-bit floating point (F32) format
+## Input Processing
+- Uses BERT tokenization
+- Supports sequences up to 512 tokens
+## Output
+- 5-class multi-label classification
+## Performance Metrics
+- Accuracy score: 0.985764
+- F1 score (Micro): 0.993132
+- F1 score (Macro): 0.993694
+## Training Dataset
+This model was trained on the [scfengv/TVL-game-layer-dataset](https://huggingface.co/datasets/scfengv/TVL-game-layer-dataset).
+## Testing Dataset
+- [scfengv/TVL-game-layer-dataset](https://huggingface.co/datasets/scfengv/TVL-game-layer-dataset)
+  - validation
+  - Remove Emoji
+  - Emoji2Desc
+  - Remove Punctuation
+## Usage
 ```python
 import torch
 print(predictions)
 ```
+## Additional Notes
+- This model is specifically designed for TVL Game layer classification tasks.
+- It's based on the Chinese BERT model, indicating it's optimized for Chinese text.
+For more detailed information about the model architecture or usage, please refer to the BERT documentation and the specific fine-tuning process used for this classifier.