Spaces:

Samarth991
/

CV-Agent

Running

Samarth991 commited on Feb 23

Commit

b60841d

1 Parent(s): 5c2c847

added ultralytics

Files changed (1) hide show

app.py CHANGED Viewed

@@ -91,7 +91,8 @@ def main():
     st.session_state.visibility = "visible"
     st.title("Computer Vision Agent :sunglasses:")
-    st.markdown("Use the CV agent to do Object Detection/Panoptic Segementation/Image Segmentation/Image Descrption ")
     st.markdown(
     """
     <style>
@@ -107,12 +108,13 @@ def main():
         st.header("About Project")
         st.markdown(
             """
-            - Agent to filter images on basis multiple factors like image quality , object proportion in image , weather in the image .
-            - This application uses multiple tools like Image caption tool, DuckDuckGo search tool, Maskformer tool , weather predictor.
             """)
         st.sidebar.subheader("Upload Image !")
         option = st.sidebar.selectbox(
-            "Select the Large Language Model ",("deepseek-r1-distill-llama-70b",
                                                 "gemma2-9b-it",
                                                 "llama-3.2-3b-preview",
                                                 "llama-3.2-1b-preview",

     st.session_state.visibility = "visible"
     st.title("Computer Vision Agent :sunglasses:")
+    st.markdown("Use the CV agent to do Object Detection , Panoptic Segementation,Image Segmentation , Image Descrption task using the latest foundation models available opensource.")
+    st.markdown('The CV Agent implements an Agent that decide what and when to use to provide the information related to the image asked my the user.')
     st.markdown(
     """
     <style>
         st.header("About Project")
         st.markdown(
             """
+            - CV Agent can perform check on images to detemine the image quality and can also find out the segementaion mask and panoptic mask .
+            - This application uses multiple tools like Image caption tool, DuckDuckGo search tool, Maskformer tool , Panoptic segementation tool to perform these tasks.
+            - The decision on how to use the certain tool and when to use it soely relies on the Reasoning power of the LLM.
             """)
         st.sidebar.subheader("Upload Image !")
         option = st.sidebar.selectbox(
+            "Select your Large Language Model(LLM) ",("deepseek-r1-distill-llama-70b",
                                                 "gemma2-9b-it",
                                                 "llama-3.2-3b-preview",
                                                 "llama-3.2-1b-preview",