Marian Kannwischer PRO

canwiper

AI & ML interests

RLHF & Computer Vision

Recent Activity

liked a dataset 9 days ago

Rapidata/2k-ranked-images-open-image-preferences-v1

reacted to jasoncorkill's post with 🔥 9 days ago

🚀 We tried something new! We just published a dataset using a new (for us) preference modality: direct ranking based on aesthetic preference. We ranked a couple of thousand images from most to least preferred, all sampled from the Open Image Preferences v1 dataset by the amazing @data-is-better-together team. 📊 Check it out here: https://huggingface.co/datasets/Rapidata/2k-ranked-images-open-image-preferences-v1 We're really curious to hear your thoughts! Is this kind of ranking interesting or useful to you? Let us know! 💬 If it is, please consider leaving a ❤️ and if we hit 30 ❤️s, we’ll go ahead and rank the full 17k image dataset!

reacted to jasoncorkill's post with 🔥 11 days ago

🔥 Yesterday was a fire day! We dropped two brand-new datasets capturing Human Preferences for text-to-video and text-to-image generations powered by our own crowdsourcing tool! Whether you're working on model evaluation, alignment, or fine-tuning, this is for you. 1. Text-to-Video Dataset (Pika 2.2 model): https://huggingface.co/datasets/Rapidata/text-2-video-human-preferences-pika2.2 2. Text-to-Image Dataset (Reve-AI Halfmoon): https://huggingface.co/datasets/Rapidata/Reve-AI-Halfmoon_t2i_human_preference Let’s train AI on AI-generated content with humans in the loop. Let’s make generative models that actually get us.

View all activity

Organizations

canwiper's activity

liked a dataset 9 days ago

Rapidata/2k-ranked-images-open-image-preferences-v1

Viewer • Updated 10 days ago • 2k • 152 • 18

reacted to jasoncorkill's post with 🔥 9 days ago

Post

3212

🚀 We tried something new!

We just published a dataset using a new (for us) preference modality: direct ranking based on aesthetic preference. We ranked a couple of thousand images from most to least preferred, all sampled from the Open Image Preferences v1 dataset by the amazing @data-is-better-together team.

📊 Check it out here:
Rapidata/2k-ranked-images-open-image-preferences-v1

We're really curious to hear your thoughts!
Is this kind of ranking interesting or useful to you? Let us know! 💬

If it is, please consider leaving a ❤️ and if we hit 30 ❤️s, we’ll go ahead and rank the full 17k image dataset!

5 replies

reacted to jasoncorkill's post with 🔥 11 days ago

Post

3026

🔥 Yesterday was a fire day!
We dropped two brand-new datasets capturing Human Preferences for text-to-video and text-to-image generations powered by our own crowdsourcing tool!

Whether you're working on model evaluation, alignment, or fine-tuning, this is for you.

1. Text-to-Video Dataset (Pika 2.2 model):
Rapidata/text-2-video-human-preferences-pika2.2

2. Text-to-Image Dataset (Reve-AI Halfmoon):
Rapidata/Reve-AI-Halfmoon_t2i_human_preference

Let’s train AI on AI-generated content with humans in the loop.
Let’s make generative models that actually get us.

liked a dataset 24 days ago

Rapidata/OpenAI-4o_t2i_human_preference

Viewer • Updated 23 days ago • 13k • 2.39k • 30

reacted to jasoncorkill's post with 👀 about 1 month ago

Post

3805

At Rapidata, we compared DeepL with LLMs like DeepSeek-R1, Llama, and Mixtral for translation quality using feedback from over 51,000 native speakers. Despite the costs, the performance makes it a valuable investment, especially in critical applications where translation quality is paramount. Now we can say that Europe is more than imposing regulations.

Our dataset, based on these comparisons, is now available on Hugging Face. This might be useful for anyone working on AI translation or language model evaluation.

Rapidata/Translation-deepseek-llama-mixtral-v-deepl

1 reply

reacted to jasoncorkill's post with 👀 about 1 month ago

Post

2232

Benchmarking Google's Veo2: How Does It Compare?

The results did not meet expectations. Veo2 struggled with style consistency and temporal coherence, falling behind competitors like Runway, Pika, Tencent, and even Alibaba. While the model shows promise, its alignment and quality are not yet there.

Google recently launched Veo2, its latest text-to-video model, through select partners like fal.ai. As part of our ongoing evaluation of state-of-the-art generative video models, we rigorously benchmarked Veo2 against industry leaders.

We generated a large set of Veo2 videos spending hundreds of dollars in the process and systematically evaluated them using our Python-based API for human and automated labeling.

Check out the ranking here: https://www.rapidata.ai/leaderboard/video-models

Rapidata/text-2-video-human-preferences-veo2

liked 3 datasets about 1 month ago

liked a dataset about 2 months ago

Rapidata/OpenGVLab_Lumina_t2i_human_preference

Viewer • Updated Feb 26 • 13k • 245 • 13

reacted to jasoncorkill's post with 🔥 about 2 months ago

Post

2478

The Sora Video Generation Aligned Words dataset contains a collection of word segments for text-to-video or other multimodal research. It is intended to help researchers and engineers explore fine-grained prompts, including those where certain words are not aligned with the video.

We hope this dataset will support your work in prompt understanding and advance progress in multimodal projects.

If you have specific questions, feel free to reach out.
Rapidata/sora-video-generation-aligned-words

reacted to jasoncorkill's post with 👀 2 months ago

Post

4656

Runway Gen-3 Alpha: The Style and Coherence Champion

Runway's latest video generation model, Gen-3 Alpha, is something special. It ranks #3 overall on our text-to-video human preference benchmark, but in terms of style and coherence, it outperforms even OpenAI Sora.

However, it struggles with alignment, making it less predictable for controlled outputs.

We've released a new dataset with human evaluations of Runway Gen-3 Alpha: Rapidata's text-2-video human preferences dataset. If you're working on video generation and want to see how your model compares to the biggest players, we can benchmark it for you.

🚀 DM us if you’re interested!

Dataset: Rapidata/text-2-video-human-preferences-runway-alpha

1 reply

liked a dataset 2 months ago

Rapidata/text-2-video-Rich-Human-Feedback

Viewer • Updated Feb 7 • 198 • 64 • 13

upvoted a collection 2 months ago

Sora Rich annotation

Collection

Contains a rich set of different types of annotations for a set of 198 videos • 6 items • Updated Feb 4 • 10

liked a dataset 2 months ago

Rapidata/sora-video-generation-time-flow

Viewer • Updated Feb 4 • 198 • 64 • 13

liked 4 datasets 3 months ago

Rapidata/sora-video-generation-aligned-words

Viewer • Updated Feb 4 • 48 • 186 • 15

Rapidata/sora-video-generation-alignment-likert-scoring

Viewer • Updated Feb 4 • 198 • 202 • 13

Rapidata/sora-video-generation-physics-likert-scoring

Viewer • Updated Feb 4 • 198 • 108 • 18

Rapidata/Runway_Frames_t2i_human_preferences

Viewer • Updated Feb 3 • 25.5k • 185 • 13

reacted to jasoncorkill's post with 🔥 3 months ago

Post

2728

We benchmarked @xai-org 's Aurora model, as far as we know the first public evaluation of the model at scale.

We collected 401k human annotations in over the past ~2 days for this, we have uploaded all of the annotation data here on huggingface with a fully permissive license
Rapidata/xAI_Aurora_t2i_human_preferences

1 reply