Building on HF

5 18 88

Tyler Williams PRO

unmodeled-tyler

https://unmodeledtyler.substack.com

AI & ML interests

AI research engineer & solo operator of VANTA Research

Recent Activity

replied to their post 3 days ago

Link to Repo: https://github.com/unmodeled-tyler/thought-tracer I had a great time at Mistral's Hackathon in SF over the weekend! There were a lot of incredibly talented builders there and it was an honor to be a part of it! 😄 I built Thought Tracer - a TUI-based logitlens application for Ministral 3B/8B with optional AI analysis from Mistral Large on the Mistral API. Thought Tracer allows you to see what the model "believes" at each layer until it arrives at it's final next token prediction. The Entropy tab displays entropy through each layer, additionally providing both token-level and prompt-level risk for hallucination. If you have a Mistral API key, the AI analysis section is actually pretty cool because it's returned in rendered markdown and easily understandable language - providing a commentary on how the model likely arrived at it's final prediction, and also offering diagnostics for model developers. This commentary actually makes the tool pretty beginner friendly to anyone interested in exploring AI research tools for the first time. Check it out if you're interested!

reacted to danielhanchen's post with 🔥 3 days ago

Qwen releases 4 new Qwen3.5 Small models: 0.8B • 2B • 4B • 9B! Run Qwen3.5-0.8B, 2B and 4B on your phone. Run 9B on 6GB RAM. The vision reasoning LLMs perform better than models 4x their size. GGUFs to run: https://huggingface.co/collections/unsloth/qwen35 Guide: https://unsloth.ai/docs/models/qwen3.5

reacted to their post with 🚀 3 days ago

View all activity

Organizations

replied to their post 3 days ago

Yep, that’s the core of it! For example if I use the prompt “Paris is the capitol of France” and then I highlight “of” in my prompt, the layer predictions tab will show you what the model believes the next token to be through each layer.

You can watch the model start it’s guess in the very first layer (usually with something completely irrelevant) and then as it progress through each layer the model gets closer and closer until it converges on “France” as the most likely correct next token based on the context leading up to the selected token “of.” So then the model basically interprets it as “Paris is the capital of -> ? -> France”

You can see in the 1st layer the model was thinking “Paris is the capital of ales” then a deeper layer it was thinking “Paris is the capital of guardians” before it finally ended in the last layer with the correct prediction (remember, based on Paris is the capital of) “France”

The entropy tab calculates a few different metrics that also give a token-level and prompt-level hallucination risk assessment so you can see which types are higher risk for inducing hallucination in that particular model.

reacted to danielhanchen's post with 🔥 3 days ago

Post

4988

Qwen releases 4 new Qwen3.5 Small models: 0.8B • 2B • 4B • 9B!

Run Qwen3.5-0.8B, 2B and 4B on your phone. Run 9B on 6GB RAM.

The vision reasoning LLMs perform better than models 4x their size.

GGUFs to run: https://huggingface.co/collections/unsloth/qwen35

Guide: https://unsloth.ai/docs/models/qwen3.5

4 replies

reacted to their post with 🚀 3 days ago

Post

2974

Link to Repo: https://github.com/unmodeled-tyler/thought-tracer

I had a great time at Mistral's Hackathon in SF over the weekend! There were a lot of incredibly talented builders there and it was an honor to be a part of it! 😄

I built Thought Tracer - a TUI-based logitlens application for Ministral 3B/8B with optional AI analysis from Mistral Large on the Mistral API.

Thought Tracer allows you to see what the model "believes" at each layer until it arrives at it's final next token prediction. The Entropy tab displays entropy through each layer, additionally providing both token-level and prompt-level risk for hallucination.

If you have a Mistral API key, the AI analysis section is actually pretty cool because it's returned in rendered markdown and easily understandable language - providing a commentary on how the model likely arrived at it's final prediction, and also offering diagnostics for model developers. This commentary actually makes the tool pretty beginner friendly to anyone interested in exploring AI research tools for the first time.

Check it out if you're interested!

4 replies

replied to their post 3 days ago

I'm actually going to continue building this out after the dust settles from the hackathon so expect more features!

posted an update 3 days ago

Post

2974

Link to Repo: https://github.com/unmodeled-tyler/thought-tracer

I had a great time at Mistral's Hackathon in SF over the weekend! There were a lot of incredibly talented builders there and it was an honor to be a part of it! 😄

I built Thought Tracer - a TUI-based logitlens application for Ministral 3B/8B with optional AI analysis from Mistral Large on the Mistral API.

Thought Tracer allows you to see what the model "believes" at each layer until it arrives at it's final next token prediction. The Entropy tab displays entropy through each layer, additionally providing both token-level and prompt-level risk for hallucination.

If you have a Mistral API key, the AI analysis section is actually pretty cool because it's returned in rendered markdown and easily understandable language - providing a commentary on how the model likely arrived at it's final prediction, and also offering diagnostics for model developers. This commentary actually makes the tool pretty beginner friendly to anyone interested in exploring AI research tools for the first time.

Check it out if you're interested!

4 replies

updated 14 models 4 days ago

reacted to sergiopaniego's post with 🚀 7 days ago

Post

2257

What happens when you make an LLM drive a car where physics are real and actions can't be undone?

I ported CARLA, the autonomous driving simulator, to OpenEnv and added training support via TRL + Hugging Face Spaces.

The model interacts with the simulator through tool calls (observe, brake, change lane) and learns from a reward signal.

In 50 training steps, Qwen 0.6B learns to swerve and brake to avoid pedestrians in emergency situations.

The project supports text and vision (VLMs can see through a camera sensor), open-world driving with traffic, and multiple driving scenarios.

This builds on the carla-env project by sinatras, which originally placed LLMs inside CARLA for evaluation. We extended it with vision, new scenarios, rubric-based rewards, and made it trainable end-to-end.

Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/
CARLA env in OpenEnv: https://github.com/meta-pytorch/OpenEnv/tree/main/envs/carla_env
Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla.py

Tyler Williams PRO

AI & ML interests

Recent Activity

Organizations

unmodeled-tyler's activity