Update notebook
Browse files
notebook
CHANGED
@@ -1,12 +1,4 @@
|
|
1 |
-
#
|
2 |
-
Viviane da Silva Dilly
|
3 |
-
|
4 |
-
In this notebook, I did part 2 and 3 of fine tuning a LLM (I chose GPT2) on the enron dataset from kaggle.
|
5 |
-
|
6 |
-
* Fine-tune a Language Model on the Enron dataset
|
7 |
-
* Create a Gradio Interface that answers questions related to the case deploying it in a Huggingface Space
|
8 |
-
|
9 |
-
As I mentioned below on my code, after many many days of trying to use the whole data set and having my code crashing after long hours of waiting, I decided to use a sample.
|
10 |
|
11 |
# I'll start by installing and importing all I need
|
12 |
|
@@ -118,8 +110,5 @@ def generate_response(question):
|
|
118 |
# Finally I'll create Gradio interface
|
119 |
gr.Interface(generate_response, "textbox", "textbox", title="Ask Enron Dataset", description="Enter a question about the case").launch()
|
120 |
|
121 |
-
|
122 |
-
|
123 |
-
(you find the link also on the top of this notebook)
|
124 |
-
|
125 |
-
#[My model deployed on Hugging Face](https://huggingface.co/vividsd/gpt2-enrondataset)
|
|
|
1 |
+
# As I mentioned below on my code, after many many days of trying to use the whole data set and having my code crashing after long hours of waiting, I decided to use a sample.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
|
3 |
# I'll start by installing and importing all I need
|
4 |
|
|
|
110 |
# Finally I'll create Gradio interface
|
111 |
gr.Interface(generate_response, "textbox", "textbox", title="Ask Enron Dataset", description="Enter a question about the case").launch()
|
112 |
|
113 |
+
# I experimented with the chatbot and it starts answering well, but then repeats the same sentence over and over in many situations
|
114 |
+
# I couldn't fix the situation, probably solved by text preprocessing
|
|
|
|
|
|