2 2 7

Jay Broughton

Kerassy

https://www.instabot.co.uk

kerassy

AI & ML interests

NLP, dataset creation, small language and multi-modal models.

Recent Activity

updated a Space 23 days ago

Kerassy/news_summarizer

published a Space 23 days ago

Kerassy/news_summarizer

updated a model about 2 months ago

Kerassy/Qwen2.5-1.5b-Instruct-Dolly

View all activity

Organizations

None yet

Kerassy's activity

updated a Space 23 days ago

News Summarizer

👀

Demo BART summarizer for short news articles

published a Space 23 days ago

News Summarizer

👀

Demo BART summarizer for short news articles

updated a model about 2 months ago

Kerassy/Qwen2.5-1.5b-Instruct-Dolly

Updated Feb 10

published a model about 2 months ago

Kerassy/Qwen2.5-1.5b-Instruct-Dolly

Updated Feb 10

liked a dataset about 2 months ago

kaist-ai/CoT-Collection

Viewer • Updated Oct 14, 2023 • 1.84M • 1.26k • 142

upvoted a paper 2 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 60

updated a dataset 2 months ago

Kerassy/misc-art-and-descriptons

Viewer • Updated Jan 30 • 1.5k • 35 • 1

reacted to clem's post with 🔥 2 months ago

Post

7252

AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!

updated 3 datasets 2 months ago

reacted to hexgrad's post with ❤️ 2 months ago

Post

3974

IMHO, being able & willing to defeat CAPTCHA, hCaptcha, or any other reasoning puzzle is a must-have for any Web-Browsing / Computer-Using Agent (WB/CUA).

I realize it subverts the purpose of CAPTCHA, but I do not think you can claim to be building AGI/agents without smoothly passing humanity checks. It would be like getting in a self-driving car that requires human intervention over speed bumps. Claiming AGI or even "somewhat powerful AI" seems hollow if you are halted by a mere CAPTCHA.

I imagine OpenAI's Operator is *able* but *not willing* to defeat CAPTCHA. Like their non-profit status, I expect that policy to evolve over time—and if not, rival agent-builders will attack that opening to offer a better product.

2 replies

liked a dataset 2 months ago

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 21.3k • 195

reacted to andito's post with 🚀 2 months ago

Post

1640

𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝘁𝗵𝗲 𝘄𝗼𝗿𝗹𝗱'𝘀 𝘀𝗺𝗮𝗹𝗹𝗲𝘀𝘁 𝘃𝗶𝘀𝗶𝗼𝗻 𝗹𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗺𝗼𝗱𝗲𝗹!

We’re thrilled to share 𝗦𝗺𝗼𝗹𝗩𝗟𝗠 (256M & 500M)—the smallest Visual Language Models ever built. Think: running on <1GB of GPU memory—you can fine-tune it on your laptop and run it on your toaster!

Why It’s Game-Changing:
- 𝗢𝘂𝘁𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝘀 𝗟𝗮𝗿𝗴𝗲𝗿 𝗠𝗼𝗱𝗲𝗹𝘀: Even the 256M model surpasses our SOTA 80B-parameter model from just 17 months ago. Over 300x reduction!
𝗠𝗶𝗴𝗵𝘁𝘆 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆: The 256M version delivers 80% of our 2.2B model’s performance, and the 500M version hits 90%
𝗟𝗶𝗴𝗵𝘁𝗻𝗶𝗻𝗴-𝗙𝗮𝘀𝘁 𝗦𝗲𝗮𝗿𝗰𝗵: SmolVLM integrates with ColiPali for state-of-the-art retrieval speeds—on par with models 10x bigger. That means cheaper, faster indexing and real-world impact.

What’s New Under the Hood:
- 𝗡𝗲𝘄 𝗩𝗶𝘀𝗶𝗼𝗻 𝗘𝗻𝗰𝗼𝗱𝗲𝗿: Smaller overall size (400M -> 93M), but with higher resolution.
- 𝗛𝗶𝗴𝗵𝗲𝗿 𝗣𝗶𝘅𝗲𝗹𝘀/𝗧𝗼𝗸𝗲𝗻: 4096 vs. 1820—more efficient image processing.
- 𝗦𝗺𝗮𝗿𝘁 𝗧𝗼𝗸𝗲𝗻𝗶𝘇𝗮𝘁𝗶𝗼𝗻: Faster training and a performance boost.

Check our blog: https://huggingface.co/blog/smolervlm
The models: HuggingFaceTB/smolvlm-256m-and-500m-6791fafc5bb0ab8acc960fb0
The demo: HuggingFaceTB/SmolVLM-256M-Demo

1 reply

updated a dataset 3 months ago

Kerassy/misc-art-and-descriptons

Viewer • Updated Jan 30 • 1.5k • 35 • 1

New activity in Kerassy/misc-art-and-descriptons 3 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 3 months ago by

librarian-bot