Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
MICHAEL A ALVES
wolverine604
Follow
0 followers
·
21 following
shakenbake604
AI & ML interests
None yet
Recent Activity
reacted
to
burtenshaw
's
post
with 👍
16 days ago
I made a real time voice agent with FastRTC, smolagents, and hugging face inference providers. Check it out in this space: 🔗 https://huggingface.co/spaces/burtenshaw/coworking_agent
reacted
to
singhsidhukuldeep
's
post
with 👍
16 days ago
O1 Embedder: Transforming Retrieval Models with Reasoning Capabilities Researchers from University of Science and Technology of China and Beijing Academy of Artificial Intelligence have developed a novel retrieval model that mimics the slow-thinking capabilities of reasoning-focused LLMs like OpenAI's O1 and DeepSeek's R1. Unlike traditional embedding models that directly match queries with documents, O1 Embedder first generates thoughtful reflections about the query before performing retrieval. This two-step process significantly improves performance on complex retrieval tasks, especially those requiring intensive reasoning or zero-shot generalization to new domains. The technical implementation is fascinating: - The model integrates two essential functions: Thinking and Embedding - It uses an "Exploration-Refinement" data synthesis workflow where initial thoughts are generated by an LLM and refined by a retrieval committee - A multi-task training method fine-tunes a pre-trained LLM to generate retrieval thoughts via behavior cloning while simultaneously learning embedding capabilities through contrastive learning - Memory-efficient joint training enables both tasks to share encoding results, dramatically increasing batch size The results are impressive - O1 Embedder outperforms existing methods across 12 datasets in both in-domain and out-of-domain scenarios. For example, it achieves a 3.9% improvement on Natural Questions and a 3.0% boost on HotPotQA compared to models without thinking capabilities. This approach represents a significant paradigm shift in retrieval technology, bridging the gap between traditional dense retrieval and the reasoning capabilities of large language models. What do you think about this approach? Could "thinking before retrieval" transform how we build search systems?
reacted
to
jasoncorkill
's
post
with 👍
16 days ago
Has OpenGVLab Lumina Outperformed OpenAI’s Model? We’ve just released the results from a large-scale human evaluation (400k annotations) of OpenGVLab’s newest text-to-image model, Lumina. Surprisingly, Lumina outperforms OpenAI’s DALL-E 3 in terms of alignment, although it ranks #6 in our overall human preference benchmark. To support further development in text-to-image models, we’re making our entire human-annotated dataset publicly available. If you’re working on model improvements and need high-quality data, feel free to explore. We welcome your feedback and look forward to any insights you might share! https://huggingface.co/datasets/Rapidata/OpenGVLab_Lumina_t2i_human_preference
View all activity
Organizations
None yet
wolverine604
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 1 month ago
caug37/TinyTim
Text Generation
•
Updated
Feb 11, 2024
•
46
•
2
liked
3 models
about 2 months ago
cognitivecomputations/dolphin-2.9-llama3-8b
Text Generation
•
Updated
May 20, 2024
•
2.71k
•
444
Tap-M/Luna-AI-Llama2-Uncensored
Text Generation
•
Updated
Jul 26, 2023
•
2.39k
•
143
Kijai/LivePortrait_safetensors
Updated
Aug 2, 2024
•
76
liked
a Space
9 months ago
Runtime error
4.33k
4.33k
OpenGPT 4o
🔥
GPT 4o like bot.
liked
a Space
about 1 year ago
Running
on
Zero
3.27k
3.27k
InstantID
😻
Generate personalized images with a face preservation
liked
2 models
about 1 year ago
Sosaka/Alpaca-native-4bit-ggml
Updated
Apr 6, 2023
•
207
TheBloke/Llama-2-13B-chat-GGML
Text Generation
•
Updated
Sep 27, 2023
•
647
•
697