Aug 11, 2021
--
series NLP S01 E02
Text Preprocessing
It involves further steps such as given below;
a) Text cleaning and pre-processing
b) word representation
c ) weighted words
d) word embeddings
a) Text cleaning and pre-processing
i. Tokenization
ii. stop words
iii. Capitalization
iv. Slang
v. noise removal
vi. spelling correction
vii. stemming
viii. lemmatization
b) word representation
i. N_Gram
c) Weighted words
i. Bag of words
ii. TF-IDF
d) Word Embedding
i. Word2Vec
ii. Glove
Please check this notebook for above explanation. I have prepared this notebook to clarify you about the topics.