Skip to main content
. 2021 Feb 9;7:e365. doi: 10.7717/peerj-cs.365

Figure 1. Steps for classification of DNA sequence: feature extraction, feature encoding and classification.

Figure 1

The first step is to input Raw sequences for feature extraction and feature encoding using K-merization and frequency-based tokenization techniques, respectively. Then the tokenized sequence fed to ML or DL model for further analysis. For deep learning, CNN and LSTM are used, whereas for machine learning RF is used.