@nicolay-r on Hugging Face: "📢 So far I noticed that 🧠 reasoning with llm 🤖 in English is tend to be…"

Post

2111

📢 So far I noticed that 🧠 reasoning with llm 🤖 in English is tend to be more accurate than in other languages.
However, besides the GoogleTrans and other open transparent translators, I could not find one that could be easy to use solutions to avoid:
1.🔴 Third-party framework installation
2.🔴 Text chunking
3.🔴 support of meta-annotation like spans / objects / etc.

💎 To cope problem of IR from non-english texts, I am happy to share the bulk-translate 0.25.0. 🎊

⭐ https://github.com/nicolay-r/bulk-translate

bulk-translate is a tiny Python 🐍 no-string framework that allows translate series of texts with the pre-annotated fixed-spans that are invariant for translator.

It supports 👨‍💻 API for quick data translation with (optionaly) annotated objects in texts (see figure below) in Python 🐍
I make it accessible as much as possible for RAG and / or LLM-powered app downstreams:
📘 https://github.com/nicolay-r/bulk-translate/wiki

All you have to do is to provide iterator of texts, where each text:
1. ✅ String object
2. ✅ List of strings and nested lists that represent spans (value + any ID data).

🤖 By default I provide a wrapper over googletrans which you can override with your own 🔥
https://github.com/nicolay-r/bulk-translate/blob/master/models/googletrans_310a.py

Join the conversation