Migrated from using print statements in the application code to using logger module (left prints in files intended to be run as scripts) 2d7adb6 alfraser commited on Mar 4, 2024
Added page level comments to describe the purpose of each page 4f07f72 alfraser commited on Mar 4, 2024
Refactored the page into smaller functions to make it more modular and easier to follow and maintain. cfa39d8 alfraser commited on Mar 4, 2024
Added additional inline display of my dissertation on the dissertation page a898358 alfraser commited on Feb 23, 2024
Updated the architecture descriptions, images and caption text for the display of the architectures cc46ec6 alfraser commited on Feb 6, 2024
Added trendline option in the scatterplot and associated update to the project requirements file for deployent on HF spaces 943d243 alfraser commited on Feb 6, 2024
Updated from using random.choices to random.sample throughout where I need a random distinct set as choices does replacement so you can get the same item twice. Discovered in pricing testing. b897a48 alfraser commited on Feb 5, 2024
Added seoarate key for the question count as getting weird results in the counts f3f6cf6 alfraser commited on Feb 5, 2024
Added ability to set the number of testing threads dynamically from the UI fc8884e alfraser commited on Feb 5, 2024
Modified test runner to dispatch requests in parallel to make use of the fact that there is a lot of wait time for the LLM. Defaulting to 16 threads. bb7db2c alfraser commited on Feb 1, 2024
Added runner for pricing fact checks to assess the level of fact embedding in the latest model c319c31 alfraser commited on Feb 1, 2024
Saved test records and refactored reporter UI code into smaller functions a9d1d49 alfraser commited on Jan 31, 2024
Updated the testing page to show the request/response pairs 9cec719 alfraser commited on Jan 30, 2024
Added a push button to generate a random question to the UI, so users don't have to phrase something themselves. 7c479ac alfraser commited on Jan 26, 2024
Added the option to pause a failed endpoint in order to be able to kick it with a restart 5ecd875 alfraser commited on Jan 26, 2024
Added ability to select which models to compare side by side, allowing for more flexibility in testing my fine-tuned llamas 57b94ca alfraser commited on Jan 25, 2024
Added a test runner page which allows you to run a batch of test from the UI ab87be2 alfraser commited on Jan 24, 2024
Fixed issue in resolving sleep function caused by import time.time elsewhere f443824 alfraser commited on Jan 23, 2024
Removed decimals from group tag and moved where it is called in the side by side flow so they actually share the same ID 2f008c2 alfraser commited on Jan 23, 2024
Changed function name as getting a conflict on the server which I am not getting locally 59df961 alfraser commited on Jan 23, 2024
Added the display of the question typed into the side by side setup. d0a9930 alfraser commited on Jan 20, 2024
Removed debug print statements which cause extra chaff in the logs 1c63a8a alfraser commited on Jan 20, 2024
Updated to refresh every 10 seconds when an endpoint is starting up 38dd285 alfraser commited on Jan 19, 2024
Added a page to see and control (pause/resume) the model inference endpoints dfe6290 alfraser commited on Jan 19, 2024
Updated page as I learnt there should only be one final document 0d9d7ad alfraser commited on Jan 19, 2024
Removed now unused code which calls huggin face via the free HTTP interface - moved everything to endpoints to be equivalent. 227dc25 alfraser commited on Jan 19, 2024
Fixed a display issue in the markdown with dollar signs in prices aff284c alfraser commited on Jan 16, 2024
Added option to include a diagram of the architecture under test on the architectures page. 2db4636 alfraser commited on Jan 15, 2024
Added structure for the final dissertation work product (documents and video). e30a304 alfraser commited on Dec 8, 2023
Integrated the architecture page structure and code from the old codebase and integrated for hugging face deployment cc9a95f alfraser commited on Nov 29, 2023