Adarsha Regmi
Aug 11, 2021

series NLP S01 E02

Text Preprocessing

It involves further steps such as given below;

a) Text cleaning and pre-processing

b) word representation

c ) weighted words

d) word embeddings

a) Text cleaning and pre-processing

i. Tokenization

ii. stop words

iii. Capitalization

iv. Slang

v. noise removal

vi. spelling correction

vii. stemming

viii. lemmatization

b) word representation

i. N_Gram

c) Weighted words

i. Bag of words

ii. TF-IDF

d) Word Embedding

i. Word2Vec

ii. Glove

Please check this notebook for above explanation. I have prepared this notebook to clarify you about the topics.