Skip to main content
. 2022 Nov 7;13:6717. doi: 10.1038/s41467-022-34305-6

Fig. 9. Architecture of the used Neural Network.

Fig. 9

For both regression and classification, the network first consists of three stacked long short-term memory (LSTM) layers98 of sizes 128, 128 and 64. For regression, the last LSTM is directly fully connected into the output layer returning a mean μ and variance σ, while for classification the output layer is preceded by another fully connected layer of size 20. The architecture is inspired by the successful applications of recurrent neural networks during the AnDi-Challenge62,68,71,74.