Ultronprime commited on
Commit
09bdbf7
·
verified ·
1 Parent(s): 8974eaf

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +27 -9
README.md CHANGED
@@ -1,25 +1,43 @@
1
  ---
2
- title: Cloud RAG Webhook
3
  emoji: 📄
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: docker
7
  app_port: 7860
8
  pinned: false
 
 
 
9
  license: apache-2.0
10
  ---
11
 
12
- # Cloud RAG Webhook
13
 
14
- This Space contains a RAG (Retrieval Augmented Generation) system for processing documents.
15
- It was migrated from the GitHub repository: https://github.com/Daanworg/cloud-rag-webhook.git
16
 
17
  ## Features
18
 
19
  - Automated processing of text files
20
- - Document AI integration
21
- - BigQuery storage for document chunks
22
- - Vertex AI for embeddings and search
23
- - GitHub backup integration
24
 
25
- See the README_AUTOMATION.md file for details on how to use this system.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: RAG Document Processing
3
  emoji: 📄
4
  colorFrom: blue
5
  colorTo: green
6
  sdk: docker
7
  app_port: 7860
8
  pinned: false
9
+ models:
10
+ - sentence-transformers/all-MiniLM-L6-v2
11
+ - facebook/bart-large-cnn
12
  license: apache-2.0
13
  ---
14
 
15
+ # RAG Document Processing
16
 
17
+ This Hugging Face Space contains a RAG (Retrieval Augmented Generation) system for processing documents, built entirely with Hugging Face tools and models.
 
18
 
19
  ## Features
20
 
21
  - Automated processing of text files
22
+ - OCR and document processing with HF models
23
+ - Vector storage with FAISS
24
+ - Embeddings with sentence-transformers
25
+ - Simple query interface with Gradio
26
 
27
+ ## Implementation
28
+
29
+ This is a Hugging Face-native implementation of the original cloud-based RAG system. It uses:
30
+
31
+ - Sentence Transformers for embeddings
32
+ - FAISS for vector search
33
+ - Transformers models for document processing
34
+ - Persistent storage for data
35
+ - A100 GPU acceleration (for pro users)
36
+
37
+ ## Usage
38
+
39
+ 1. Upload documents to be processed
40
+ 2. The system automatically processes and indexes them
41
+ 3. Query the processed documents through the interface
42
+
43
+ See the CLAUDE_HF.md file for implementation details.