Spaces:
Sleeping
Sleeping
Commit History
AWS credentials no longer a requirement for app to work
30b5dc1
Gradio 4.21. Limitations on file size and creating embeddings. Added AWS integration
e0fe055
Removed GRADIO_ROOT_PATH from the Dockerfile as it is not necessary for API Gateway access
7e9dd76
Sean-Case
commited on
Updated to latest Gradio version that hopefully solved root_url issues. Added ssl_verify=False to launch
813f823
Sean-Case
commited on
Set SDK version to 4.20.0 on huggingface. Added GRADIO_SERVER_PORT to Dockerfile
13cab70
Sean-Case
commited on
set gradio version to 4.18.0, before root path issue reappears
8ba34be
Sean-Case
commited on
Changed dockerfile gradio_root_path variable
745a159
Sean-Case
commited on
String query not correctly specified in fuzzy search, changed this
ff8dfa3
Sean-Case
commited on
ECS not allowing me to save files so increasing container privileges in Dockerfile
9c0a094
Sean-Case
commited on
set Dockerfile Gradio_root_path environmental variable
29ce26a
Sean-Case
commited on
Now loads in embedding model locally in Dockerfile
3034296
Sean-Case
commited on
edited Dockerfile slightly
55b0541
Sean-Case
commited on
Added small line to dockerfile start. Added dockerignore file for local builds
d99f440
Sean-Case
commited on
Improved code for cleaning and outputting files. Added Dockerfile
4ee3470
Sean-Case
commited on
Improved xlsx output formatting. Deals better with cleaning data then analysing in same session.
352c02a
Sean-Case
commited on
Added highlight search term functionality to keyword search output
36a404e
Updated to Gradio 4.16.0. Now works correctly with BGE embeddings
2bcd818
Upgraded to Gradio 4.16.0. Added Spacy fuzzy search functionality.
4ce2224
Sean-Case
commited on
Changed intro text
8c115b3
Sean-Case
commited on
Changed requirements to download correct spacy model version
4e77f14
Sean-Case
commited on
Cut out semantic search temporarily while issues with Jina gated model resolved. Improved error/progress tracking and messaging. Placeholder for Spacy fuzzy search.
739b386
Switch back to Gradio version 3.50.2 as 4 is not working with pyinstaller app creation yet
c6dc87d
Sean-Case
commited on
Better error checking. Doesn't load in embeddings file twice now.
63049fe
Sean-Case
commited on
Fixed data input for semantic search. Allowed for docs to be loaded in directly for semantic search. 0.2.1
3df8e40
Sean-Case
commited on
Minor changes to file path for outputs, documentation, location of pyinstaller build dependencies
200480d
Added accelerate to requirements. Minor doc changes
f2db299
Many changes to code organisation. More efficient searches from using intermediate outputs. Version 0.1
99d6fba
Now works correctly with npz. Minor formatting improvements
d3b1ac5
Changed in_query to keyword_query
6768a6d
Added polars to requirements
25489e9
Added semantic search using Jina
ceb8617
Faster embedding with GPU, fast document split, writes to chromadb file correctly. No longer needs FAISS or langchain
2cb9977
Now outputs correct dataframe for semantic search. Can join on extra details
2a8aba8
Sean-Case
commited on
Added to requirements: transformers
acfac99
Sean-Case
commited on
Added basic semantic search functionality
78d71d4
Sean-Case
commited on
added nltk punkt load
ba838fc
Sean-Case
commited on
Added stopwords and wordnet nltk dependencies
c1da670
Sean-Case
commited on
Added nltk download for names for HF use
7e5fca9
Sean-Case
commited on
Updates to readme file. Changed app file name to work with HF.
5c04910
Sean-Case
commited on
Initial commit
a9c2120
Sean-Case
commited on
Create sync_to_hf.yml
589448e
unverified
Sean Pedrick-Case
commited on
Initial commit
c2eff10
unverified
Sean Pedrick-Case
commited on