| cINN | Output size | 
| Learnable downsampling | |
| Level 1 conditional section | |
| Learnable downsampling | |
| Level 2 conditional section | |
| Flatten | 784 | 
| Split: 656 to output | 128 | 
| Level 3 dense-conditional section | 128 | 
| cINN | Output size | 
| Learnable downsampling | |
| Level 1 conditional section | |
| Learnable downsampling | |
| Level 2 conditional section | |
| Flatten | 784 | 
| Split: 656 to output | 128 | 
| Level 3 dense-conditional section | 128 |