AI Lab - Sofia University

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

melaniab updated a dataset 20 days ago

sofia-uni/toxic-data-bg

melaniab updated a model about 2 months ago

sofia-uni/toxic-bert-bg

melaniab updated a dataset 2 months ago

sofia-uni/toxic-data-bg

View all activity

sofia-uni's activity

melaniab

updated a dataset 20 days ago

sofia-uni/toxic-data-bg

Updated 20 days ago • 17

melaniab

updated a model about 2 months ago

sofia-uni/toxic-bert-bg

Text Classification • Updated Feb 26 • 1

melaniab

updated a dataset 2 months ago

sofia-uni/toxic-onto-bg

Updated Feb 10 • 14

s-emanuilov

posted an update 2 months ago

Post

5260

Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth

I wanted to share my experiment with training reasoning models in languages other than English/Chinese.

Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.

Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/

The model itself: s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

I hope this helps anyone looking to build reasoning models in their language.

4 replies

s-emanuilov

authored 2 papers 3 months ago

Billion-scale Similarity Search Using a Hybrid Indexing Approach with Advanced Filtering

Paper • 2501.13442 • Published Jan 23

A quantitative framework for evaluating architectural patterns in ML systems

Paper • 2501.11543 • Published Jan 20

s-emanuilov

posted an update 3 months ago

Post

581

A new benchmark (DPAB-α) has been released that evaluates LLM function calling in both Pythonic and JSON approaches.

It shows that Pythonic function calling often outperforms traditional JSON-based methods, especially for complex multi-step tasks.

Key findings from benchmarks:
— Claude 3.5 Sonnet leads with 87% on Pythonic vs 45% on JSON
— Smaller models show impressive results (Dria-Agent-α-3B: 72% Pythonic)
— Even larger models like DeepSeek V3 (685B) show significant gaps (63% Pythonic vs 33% JSON)

If you're building or using LLM agents, these results suggest that how you implement function calling could impact performance - might be worth reconsidering JSON-only approaches.

The benchmark: https://github.com/firstbatchxyz/function-calling-eval
Blog post: https://huggingface.co/blog/andthattoo/dpab-a

s-emanuilov

posted an update 3 months ago

Post

558

New paper from Salesforce AI Research. The authors found that joint training, continual pre-training (CPT), and instruction tuning with a 50/50 data split achieve better results than sequential training. Their 8B parameter model outperformed larger 70B models on financial tasks.

Down-sampling CPT data to match IT data size improved performance on CFA Challenge exams from 34.44% to 55.56%, while maintaining strong general knowledge capabilities as shown by comparable or better performance on general knowledge benchmarks like AI2-ARC and MMLU.

Technical implementation involved two-stage training: Group 1 utilized 3.84B tokens from web and basic texts, followed by Group 2, which used 1.66B tokens from domain-specific books. Their preference alignment method used generative reward models to identify and correct reasoning errors rather than just rating full solutions.

Evaluation on 91,872 samples across 31 tasks showed their Llama-Fin model achieving 91.13% accuracy on sentiment analysis (FPB) and 95.32% on FiQA SA, exceeding GPT-4's performance of 82.16% and 68.51%, respectively, on these benchmarks.

It could be useful for many financial companies looking to build AI pipelines.

Interesting read, but neither the model nor GitHub repo is accessible yet. The key insight for AI builders is that with small models - it is fully possible to outperform much bigger models.

https://arxiv.org/abs/2501.04961

s-emanuilov

posted an update 4 months ago

Post

2579

Hey HF community! 👋

Excited to share Monkt - a tool I built to solve the eternal headache of processing documents for ML/AI pipelines.

What it does: Converts PDFs, Word, PowerPoint, Excel, Web pages or raw HTML into clean Markdown or structured JSON.

Great for:
✔ LLM training dataset preparation;
✔ Knowledge base construction;
✔ Research paper processing;
✔ Technical documentation management.

It has API access for integration into ML pipelines.

Check it out at https://monkt.com/ if you want to save time on document processing infrastructure.

Looking forward to your feedback!

3 replies

melaniab

updated a Space 7 months ago

README

💻

s-emanuilov

authored a paper about 1 year ago

Architectural Approaches to Overcome Challenges in the Development of Data-Intensive Systems

Paper • 2312.03049 • Published Dec 5, 2023 • 2

AI & ML interests

Recent Activity

Team members 4

sofia-uni's activity

README