Skip to main content
. 2020 Nov 18;8:209127–209137. doi: 10.1109/ACCESS.2020.3039168

TABLE 2. Data Cleaning Prior to Analysis.

Pre-processing step Function
Fix abbreviations Replace short words with full words
Remove irrelevant characters Remove redundant characters including links, email IDs
Fix word lengthening Removal of additional, repeated characters
Stopword removal Removal of words including “the”, “an” and customized stopwords
SpellCheck Fix wrong spelling
Punctuation removal Remove punctuations
Lemmatization Group words with similar meanings to a single item