AnkitAI (Ankit Aglawe)

upvoted a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 380

liked a model 5 months ago

AnkitAI/distilbert-base-uncased-financial-news-sentiment-analysis

Text Classification • Updated Nov 1, 2024 • 46 • 2

updated a model 5 months ago

AnkitAI/distilbert-base-uncased-financial-news-sentiment-analysis

Text Classification • Updated Nov 1, 2024 • 46 • 2

updated 3 models 9 months ago

reacted to Tar9897's post with 👍 9 months ago

Post

3294

As we advance on the path towards true Artificial General Intelligence (AGI), it's crucial to recognize and address the limitations inherent in current technologies, particularly in large language models (LLMs) like those developed by OpenAI. While LLMs excel in processing and generating text, their capabilities are largely constrained to the domains of natural language understanding and generation. This poses significant limitations when dealing with more complex, abstract mathematical concepts such as topological analysis, 3D geometry, and homotopy type theory.

Topological Analysis and 3D Geometry: LLMs currently do not possess the inherent ability to understand or interpret the spatial and geometric data that is critical in fields like robotics, architecture, and advanced physics. These models lack the capacity to visualize or manipulate three-dimensional objects or comprehend the underlying properties that govern these forms.

Homotopy Type Theory is a branch of mathematics that combines homotopy theory and type theory. Homotopy type theory provides tools for a more robust handling of equivalences and transformations, something that LLMs are not designed to handle directly.

For the development of AGI, it is not sufficient to merely enhance existing models' capacities within their linguistic domains. Instead, a synthesis of symbolic AI with an understanding of homotopy type theory could pave the way. Symbolic AI, which manipulates symbols and performs logical operations, when combined with the abstract mathematical reasoning of homotopy type theory, could lead to breakthroughs in how machines understand and interact with the world.

To address these limitations we have developed Tenzin, which is a one-of-a-kind model with a planned release date within the next 1-2 weeks . To learn more join the waitlist at https://octave-x.com/.

9 replies

·

liked a model 9 months ago

marathi-llm/MahaMarathi-7B-v24.01-Base

Text Generation • Updated Feb 21, 2024 • 81 • 36

liked a dataset 9 months ago

cardiffnlp/tweet_eval

Viewer • Updated Jan 4, 2024 • 201k • 19.8k • 123

reacted to gokaygokay's post with 👍 9 months ago

Post

3022

I've created a Stable Diffusion 3 (SD3) image generation space for convenience. Now you can:

1. Generate SD3 prompts from images
2. Enhance your text prompts (turn 1-2 words into full SD3 prompts)

https://huggingface.co/spaces/gokaygokay/SD3-with-VLM-and-Prompt-Enhancer

These features are based on my custom models:

- VLM captioner for prompt generation:
- gokaygokay/sd3-long-captioner

- Prompt Enhancers for SD3 Models:
- gokaygokay/Lamini-Prompt-Enchance-Long
- gokaygokay/Lamini-Prompt-Enchance

You can now simplify your SD3 workflow with these tools!

reacted to nikgr's post with 🔥 9 months ago

Post

1562

🐦 Do you remember IBIS? Not a fancy bird but the open challenge in Inferring Binding Specificities of unexplored human transcription factors. Check our site (https://ibis.autosome.org/) and have a sip of fresh news below.

👥 More than 100 teams registered for the challenge yet only two dozen are using the opportunity to explore their models on the Leaderboard. Don't miss the chance to participate in the Leaderboard stage, although independently of that you can submit the final solution.

🌐 Remember, the training data for Leaderboard and Final are available online, and you are free to mix-and-match it in any combination.

🌌 For Leaderboard, we have received 650 total submissions of AAA (advanced ML) and 296 PWM models (a whopping set of 6682 PWMs in total).

🚀 For PWMs, the baseline is left far behind, but some TFs remain tough nuts to be cracked (see the attached figure 1).

📈 For AAAs, there is a solid improvement over the best-submitted PWMs in A2G, but the G2A discipline remains unpopular (see the attached figure 2). Free hint: this is your chance!

💡 Another free hint: If your model tends to overfit given a limited set of data for some TFs don't forget to use reverse-complement and shift augmentations. Also, don't hesitate to use multitarget models i.e. predicting the binding of multiple TFs at the same time.

💡 Last but not least, try to combine knowledge from all accessible experiment types, especially for G2A discipline (ChIP-Seq & genomic HT-SELEX) in a single model!

📣 Finally and importantly, following the requests from the community, we decided to EXTEND the Leaderboard until the final submission deadline.

🗓️ The final submission deadline is also EXTENDED until Aug 15. The final submission form and details will be posted on the IBIS website in the first half of July, follow our Telegram group and mailing list (see the links at https://ibis.autosome.org).

reacted to m-ric's post with 🔥 9 months ago

Post

2705

𝗬𝗼𝘂 𝗱𝗼𝗻'𝘁 𝗻𝗲𝗲𝗱 "𝗳𝘂𝗻𝗰𝘁𝗶𝗼𝗻 𝗰𝗮𝗹𝗹𝗶𝗻𝗴 𝗳𝗶𝗻𝗲-𝘁𝘂𝗻𝗶𝗻𝗴" 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗴𝗼𝗼𝗱 𝗮𝗴𝗲𝗻𝘁𝘀 ⛔

It's trendy to share models "fine-tuned for function calling"; but from my observations, this fine-tuning is not necessary or sufficient to build good agent systems.
To name only a few:
🐦‍⬛ Nexusflow/𝗡𝗲𝘅𝘂𝘀𝗥𝗮𝘃𝗲𝗻-𝗩𝟮-𝟭𝟯𝗕
⌘ CohereForAI/𝗰𝟰𝗮𝗶-𝗰𝗼𝗺𝗺𝗮𝗻𝗱-𝗿-𝗽𝗹𝘂𝘀
⛵️ mistralai/𝗠𝗶𝘅𝘁𝗿𝗮𝗹-𝟴𝘅𝟮𝟮𝗕-𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁-𝘃𝟬.𝟭
"Fine-tuned for function-calling" generally means "fine-tuned to generate function calls in correct JSON for extremely simple tasks". In other terms, it means "improve the formatting of the tool calls".

Yet I discovered two things while improving Transformers Agents:
🧐 Even when used as JSON agents, these fine-tuned models don't perform very well
🏅 𝙂𝙤𝙤𝙙 𝙗𝙖𝙨𝙚 𝙢𝙤𝙙𝙚𝙡𝙨 𝙥𝙚𝙧𝙛𝙤𝙧𝙢 𝙗𝙚𝙩𝙩𝙚𝙧 𝙬𝙞𝙩𝙝𝙤𝙪𝙩 𝙖𝙣𝙮 𝙛𝙞𝙣𝙚-𝙩𝙪𝙣𝙞𝙣𝙜, 𝙟𝙪𝙨𝙩 𝙥𝙡𝙖𝙞𝙣 𝙥𝙧𝙤𝙢𝙥𝙩𝙞𝙣𝙜. (Llama-3-70B-Instruct, GPT-4o, Claude-3.5-Sonnet)

👇 The graph below shows the count of errors for my GPT-4o validation run on the GAIA benchmark: 𝙰𝚐𝚎𝚗𝚝𝙿𝚊𝚛𝚜𝚒𝚗𝚐𝙴𝚛𝚛𝚘𝚛 and 𝙰𝚐𝚎𝚗𝚝𝙴𝚡𝚎𝚌𝚞𝚝𝚒𝚘𝚗𝙴𝚛𝚛𝚘𝚛 are the ones caused by incorrect formatting.
➤ As you can see, their count is already close to 0!
And given that GPT-4o is certainly not fine-tuned for our Code tool calling format, this shows that "function calling fine-tuning" is not necessary!

The hardest thing to get right in an agent is still to 𝙥𝙡𝙖𝙣 𝙜𝙤𝙤𝙙 𝙩𝙖𝙨𝙠-𝙨𝙤𝙡𝙫𝙞𝙣𝙜 𝙩𝙧𝙖𝙟𝙚𝙘𝙩𝙤𝙧𝙞𝙚𝙨 𝙤𝙫𝙚𝙧 𝙨𝙚𝙫𝙚𝙧𝙖𝙡 𝙨𝙩𝙚𝙥𝙨.
To improve this, we could:
- Use more powerful base models
- Make tool calling datasets with complex solving trajectories
- Use RL! cc @lvwerra

3 replies

·

replied to DmitryRyumin's post 9 months ago

This looks awesome!

reacted to DmitryRyumin's post with 🔥 9 months ago

Post

2060

🔥🎭🌟 New Research Alert - ECCV 2024 (Avatars Collection)! 🌟🎭🔥
📄 Title: Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture 🔝

📝 Description: Topo4D is a novel method for automated, high-fidelity 4D head tracking that optimizes dynamic topological meshes and 8K texture maps from multi-view time-series images.

👥 Authors: @Dazz1e , Y. Cheng, @Ryan-sjtu , H. Jia, D. Xu, W. Zhu, Y. Yan

📅 Conference: ECCV, 29 Sep – 4 Oct, 2024 | Milano, Italy 🇮🇹

📄 Paper: Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture (2406.00440)

🌐 Github Page: https://xuanchenli.github.io/Topo4D/
📁 Repository: https://github.com/XuanchenLi/Topo4D

🚀 CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

🚀 WACV-2024-Papers: https://github.com/DmitryRyumin/WACV-2024-Papers

🚀 ICCV-2023-Papers: https://github.com/DmitryRyumin/ICCV-2023-Papers

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #Topo4D #4DHead #3DModeling #4DCapture #FacialAnimation #ComputerGraphics #MachineLearning #HighFidelity #TextureMapping #DynamicMeshes #GaussianSplatting #VisualEffects #ECCV2024

1 reply

·

liked a dataset 9 months ago

AnkitAI/CategorizedTextReviews

Viewer • Updated Jul 1, 2024 • 7k • 23 • 4

liked 2 Spaces 9 months ago

1

AnkitAI Deberta V3 Small Base Emotions Classifier

📊

1

Deberta Xlarge Base Emotions Classifier

🏃

updated a dataset 10 months ago

AnkitAI/CategorizedTextReviews

Viewer • Updated Jul 1, 2024 • 7k • 23 • 4

reacted to merve's post with 🔥 10 months ago

Post

3292

Just shipped: introduction to vision language models (aka image-text-to-text) https://huggingface.co/tasks/image-text-to-text

Learn about more machine learning tasks at https://huggingface.co/tasks

liked a dataset 10 months ago

cardiffnlp/super_tweeteval

Viewer • Updated Jul 30, 2024 • 255k • 956 • 11

Ankit Aglawe

AI & ML interests

Recent Activity

Organizations

AnkitAI's activity

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

AnkitAI/distilbert-base-uncased-financial-news-sentiment-analysis

AnkitAI/distilbert-base-uncased-financial-news-sentiment-analysis

AnkitAI/deberta-xlarge-base-emotions-classifier

AnkitAI/reviews-roberta-base-sentiment-analysis

AnkitAI/deberta-v3-small-base-emotions-classifier

marathi-llm/MahaMarathi-7B-v24.01-Base

cardiffnlp/tweet_eval

AnkitAI/CategorizedTextReviews

AnkitAI Deberta V3 Small Base Emotions Classifier

Deberta Xlarge Base Emotions Classifier

AnkitAI/CategorizedTextReviews

cardiffnlp/super_tweeteval