Spaces:

shwetashweta05
/

Natural_language_processing

Sleeping

shwetashweta05 commited on Feb 2

Commit

7c371e8

verified ·

1 Parent(s): 946d14e

Update pages/NLP.py

Files changed (1) hide show

pages/NLP.py CHANGED Viewed

@@ -16,7 +16,7 @@ def show_page(page):
         st.title("Text preprocessing")
         st.markdown(
             """
-            ### Text preprocessing Text preprocessing converts raw data into a suitable format for computer models to understand and process that data. It processes all the data while preserving the actual meaning and context of human language in numbers. This preprocessing is done in multiple steps, but the number of steps can vary depending on the nature of the text and the goals you want to achieve with NLP.
             - **Tokenization**: It breaks down text into smaller units called tokens. These tokens can be words, characters, or punctuation marks. For example, the sentence “I want to learn NLP.” would be tokenized into: I, want, to, learn, NLP,..
             - **Stop Words**: Stopwords are words without meaning in the text, such as “is”, “the”, and “and”. Removing these words makes it easier to focus on meaningful words.
             - **Stemming**: Stemming strips away suffixes and reduces words to their base form. For example, “going” will be reduced to “go”.

         st.title("Text preprocessing")
         st.markdown(
             """
+            ### Text preprocessing converts raw data into a suitable format for computer models to understand and process that data. It processes all the data while preserving the actual meaning and context of human language in numbers. This preprocessing is done in multiple steps, but the number of steps can vary depending on the nature of the text and the goals you want to achieve with NLP.
             - **Tokenization**: It breaks down text into smaller units called tokens. These tokens can be words, characters, or punctuation marks. For example, the sentence “I want to learn NLP.” would be tokenized into: I, want, to, learn, NLP,..
             - **Stop Words**: Stopwords are words without meaning in the text, such as “is”, “the”, and “and”. Removing these words makes it easier to focus on meaningful words.
             - **Stemming**: Stemming strips away suffixes and reduces words to their base form. For example, “going” will be reduced to “go”.