Sambit Mukherjee PRO

sadhaklal

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face Discord Community's profile picture

sadhaklal's activity

New activity in discord-community/LevelBot 7 days ago
commented on Tool Use, Unified 11 days ago
view reply

Excellent article. Very clearly written.

I have one question though. It seems that that model replies with either (i) a text response or (ii) a tool call. However, in the original ReAct paper, there is a "Thought" -> "Action" -> "Observation" cycle. In other words, in response to the user's query, the model first outputs a "Thought" followed by an "Action". How do I implement this (i.e., make the model "think" before performing a tool call)?

The following are the original ReAct prompts for HotpotQA (from the official ReAct GitHub repo): https://raw.githubusercontent.com/ysymyth/ReAct/refs/heads/master/prompts/prompts_naive.json

If you examine these prompts, you'll notice that the "thoughts" come before the "actions".

upvoted an article 13 days ago
posted an update 13 days ago
view post
Post
1772
What happens when you combine the Chain of Thought (CoT) reasoning capabilities of LLMs with a heuristic-guided tree search algorithm? In the Tree of Thoughts (ToT) paper, the authors (Yao et al.) have coupled GPT-4 with tree search algorithms to attack a few tasks on which left-to-right CoT struggles. And the results are impressive. For example, on the "Game of 24" task, while GPT-4 with CoT prompting only managed to solve 4% of tasks, ToT achieved a success rate of 74%.

I've written a blog post that makes the ToT paper easy to understand and implement by taking you through all the details in a step-by-step manner: https://huggingface.co/blog/sadhaklal/tree-of-thoughts

If you are interested in the topics of algorithmic AI, tree search, reasoning, planning, or "System 2" thinking, then you may find this blog post useful.
published an article 14 days ago
view article
Article

Understanding and Implementing the Tree of Thoughts Paradigm

By sadhaklal
8