ml6team
/

keyphrase-extraction-kbir-kpcrowd

@@ -85,18 +85,19 @@ class KeyphraseExtractionPipeline(TokenClassificationPipeline):
 ```python
 # Load pipeline
-model_name = "DeDeckerThomas/keyphrase-extraction-kbir-kpcrowd"
 extractor = KeyphraseExtractionPipeline(model=model_name)
 ```
 ```python
 # Inference
 text = """
 Keyphrase extraction is a technique in text analysis where you extract the important keyphrases from a text.
-Since this is a time-consuming process, Artificial Intelligence is used to automate it.
-Currently, classical machine learning methods, that use statistics and linguistics, are widely used for the extraction process.
-The fact that these methods have been widely used in the community has the advantage that there are many easy-to-use libraries.
-Now with the recent innovations in deep learning methods (such as recurrent neural networks and transformers, GANS, …),
-keyphrase extraction can be improved. These new methods also focus on the semantics and context of a document, which is quite an improvement.
 """.replace(
     "\n", ""
 )
@@ -108,14 +109,18 @@ print(keyphrases)
 ```
 # Output
-['Artificial Intelligence' 'GANS' 'Keyphrase extraction'
- 'classical machine learning' 'deep learning methods'
- 'keyphrase extraction' 'linguistics' 'recurrent neural networks'
- 'semantics' 'statistics' 'text analysis' 'transformers']
 ```
 ## 📚 Training Dataset
-KPCrowd is a keyphrase a broadcast news transcription dataset consisting of 500 English broadcast news stories from 10 different categories (art and culture, business, crime, fashion, health, politics us, politics world, science, sports, technology) with 50 docs per category. This dataset is annotated by multiple annotators that were required to look at the same news story and assign a set of keyphrases from the text itself.
 You can find more information here: https://huggingface.co/datasets/midas/kpcrowd and https://github.com/LIAAD/KeywordExtractor-Datasets.
@@ -218,4 +223,4 @@ The model achieves the following results on the Inspec test set:
 For more information on the evaluation process, you can take a look at the keyphrase extraction evaluation notebook.
 ## 🚨 Issues
-Please feel free to contact Thomas De Decker for any problems with this model.

 ```python
 # Load pipeline
+model_name = "ml6team/keyphrase-extraction-kbir-kpcrowd"
 extractor = KeyphraseExtractionPipeline(model=model_name)
 ```
 ```python
 # Inference
 text = """
 Keyphrase extraction is a technique in text analysis where you extract the important keyphrases from a text.
+Since this is a time-consuming process, Artificial Intelligence is used to automate it.
+Currently, classical machine learning methods, that use statistics and linguistics,
+are widely used for the extraction process. The fact that these methods have been widely used in the community
+has the advantage that there are many easy-to-use libraries. Now with the recent innovations in NLP,
+transformers can be used to improve keyphrase extraction. Transformers also focus on the semantics
+and context of a document, which is quite an improvement.
 """.replace(
     "\n", ""
 )
 ```
 # Output
+['Artificial Intelligence', 'Keyphrase extraction', 'NLP',
+ 'Transformers also', 'advantage', 'automate',
+ 'classical machine learning', 'community', 'context', 'document',
+ 'extract', 'extraction', 'extraction process', 'focus',
+ 'important', 'improvement', 'innovations', 'keyphrase',
+ 'keyphrases', 'libraries', 'linguistics', 'methods', 'process',
+ 'recent', 'semantics', 'statistics', 'technique', 'text',
+ 'text analysis', 'time-consuming', 'transformers', 'widely']
 ```
 ## 📚 Training Dataset
+KPCrowd is a broadcast news transcription dataset consisting of 500 English broadcast news stories from 10 different categories (art and culture, business, crime, fashion, health, politics us, politics world, science, sports, technology) with 50 docs per category. This dataset is annotated by multiple annotators that were required to look at the same news story and assign a set of keyphrases from the text itself.
 You can find more information here: https://huggingface.co/datasets/midas/kpcrowd and https://github.com/LIAAD/KeywordExtractor-Datasets.
 For more information on the evaluation process, you can take a look at the keyphrase extraction evaluation notebook.
 ## 🚨 Issues
+Please feel free to start discussions in the Community Tab.