Language Detection
Collection
StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText
•
2 items
•
Updated
•
3
This model is an export of this FastText Language Identification model for staticvectors
. staticvectors
enables running inference in Python with NumPy. This helps it maintain solid runtime performance.
Language detection is an important task and identification with n-gram models is an efficient and highly accurate way to do it.
This model is a quantized version of the base language id model. It's using 2x256 Product Quantization like the original quantized model from FastText. This shrinks this model down to 4MB with only a minor hit on accuracy.
from staticvectors import StaticVectors
model = StaticVectors("neuml/language-id-quantized")
model.predict(["What language is this text?"])
Base model
NeuML/language-id