yeniguno
/

bert-uncased-intent-classification

Text Classification

intent-classificaton

Inference Endpoints

Model card Files Files and versions Community

bert-uncased-intent-classification / README.md

yeniguno's picture

Update README.md

8caf431 verified 24 days ago

|

history blame contribute delete

3.04 kB

	---
	library_name: transformers
	tags:
	- intent-classificaton
	- text-classification
	license: apache-2.0
	language:
	- en
	base_model:
	- google-bert/bert-base-uncased
	pipeline_tag: text-classification
	---

	# Model Card for Model ID

	This is a fine-tuned BERT-based model for intent classification, capable of categorizing intents into 82 distinct labels. It was trained on a consolidated dataset of multilingual intent datasets.


	## How to Get Started with the Model

	Use the code below to get started with the model.

	```python
	from transformers import AutoModelForSequenceClassification, AutoTokenizer, pipeline

	model = AutoModelForSequenceClassification.from_pretrained("yeniguno/bert-uncased-intent-classification")
	tokenizer = AutoTokenizer.from_pretrained("yeniguno/bert-uncased-intent-classification")

	pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)

	text = "Play the song, Sam."
	prediction = pipe(text)

	print(prediction)

	# [{'label': 'play_music', 'score': 0.9997674822807312}]
	```

	## Uses

	This model is intended for:

	Natural Language Understanding (NLU) tasks. Classifying user intents for applications such as:

	- Voice assistants
	- Chatbots
	- Customer support automation
	- Conversational AI systems

	## Bias, Risks, and Limitations
	The model's performance may degrade on intents that are underrepresented in the training data. Not optimized for languages other than English. Domain-specific intents not included in the dataset may require additional fine-tuning.


	## Training Details

	### Training Data

	his model was trained on a combination of intent datasets from various sources:

	Datasets Used:

	- mteb/amazon_massive_intent
	- mteb/mtop_intent
	- sonos-nlu-benchmark/snips_built_in_intents
	- Mozilla/smart_intent_dataset
	- Bhuvaneshwari/intent_classification
	- clinc/clinc_oos

	Each dataset was preprocessed, and intent labels were consolidated into 82 unique classes.

	Dataset Sizes:

	- Train size: 138228
	- Validation size: 17279
	- Test size: 17278

	### Training Procedure

	The model was fine-tuned with the following hyperparameters:

	Base Model: bert-base-uncased Learning Rate: 3e-5 Batch Size: 32 Epochs: 4 Weight Decay: 0.01 Evaluation Strategy: Per epoch Mixed Precision: FP32 Hardware: A100


	## Evaluation

	### Results

	#### Training and Validation:
	\| Epoch \| Training Loss \| Validation Loss \| Accuracy \| F1 Score \| Precision \| Recall \|
	\|-------\|---------------\|-----------------\|----------\|----------\|-----------\|--------\|
	\| 1 \| 0.1143 \| 0.1014 \| 97.38% \| 97.33% \| 97.36% \| 97.38% \|
	\| 2 \| 0.0638 \| 0.0833 \| 97.78% \| 97.79% \| 97.83% \| 97.78% \|
	\| 3 \| 0.0391 \| 0.0946 \| 97.98% \| 97.98% \| 97.99% \| 97.98% \|
	\| 4 \| 0.0122 \| 0.1013 \| 98.04% \| 98.04% \| 98.05% \| 98.04% \|

	#### Test Results:
	\| Metric \| Value \|
	\|-------------\|----------\|
	\| Loss \| 0.0814 \|
	\| Accuracy\| 98.37% \|
	\| F1 Score\| 98.37% \|
	\| Precision\| 98.38% \|
	\| Recall \| 98.37% \|