series NLP S01 E02

Text Preprocessing

It involves further steps such as given below;

a) Text cleaning and pre-processing

b) word representation

c ) weighted words

d) word embeddings

a) Text cleaning and pre-processing

i. Tokenization

ii. stop words

iii. Capitalization

iv. Slang

v. noise removal

vi. spelling correction

vii. stemming

viii. lemmatization

b) word representation

i. N_Gram

c) Weighted words

i. Bag of words

ii. TF-IDF

d) Word Embedding

i. Word2Vec

ii. Glove

Please check this notebook for above explanation. I have prepared this notebook to clarify you about the topics.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store