#### **Supplementary Figure 1.**

**Effect of dielectric constant (***k***) on the current flow across the blocking or tunneling dielectric layer (BDL or TDL).** Current density (*J*i) vs. applied voltage (*V*i) of metal/insulator/metal (MIM) devices with 'I' being **a)** 48 nm-thick pEGDMA, and **b)** 8.5 nm-thick pV3D3. **c)** The expected values of *J*<sub>i</sub> for BDL and TDL vs.  $V_{CG}$  of a M/BDL/M/TDL/M device wherein M is Al, BDL is the 48 nm-thick pEGDMA, and TDL is the 8.5 nm-thick pV3D3. *V*<sub>CG</sub> was converted from the coupling ratio ( $a_{CR}$ ) for each case of BDL and TDL. For more detailed description on coupling ratio, refer to Supplemtary Note 1.



#### **Supplementary Figure 2.**

**Cycling endurance characteristics of flexible organic flash memory fabricated on a PET substrate.** Memory characteristics vs. the number of repetitive Programming/ Erasing operations up to 1,000 times: **a**) transfer memory curves and **b**) threshold voltage ( $V<sub>th</sub>$ ) distribution. **c**) Measurement procedure for cycling endurance test by repetitive electrical stress (*τ*<sub>P/E</sub>= 1 s). Data are color-coded; 'red' indicates data or operations related to erasing and 'blue' indicates those related to programming. Gate leakage current after the 1,000 cycles is also shown in **a**) (dotted line)



### **Supplementary Figure 3.**

**Memory characteristics vs. flexural tensile strain**. **a)** Transfer memory curves of a flexible organic flash memory on a 250 µm-thick PET film. Gate leakage current ( $I_G$ ) after 2.8 % strain is also shown (dotted line). **b**)  $J_i$ - $E_i$  characteristics of of a pEGDMA-based MIM device obtained under tensile strain of 0.6 % to 4.0 %. **c)-j)** Photographs of the flexible organic device on a 250 µm-thick PET film under various levels of strain.



#### **Supplementary Figure 4.**

**Characteristics of the organic flash memory fabricated on a 100 µm-thick PET substrate after repeated bending cyles.**: **a)** transfer curves showing consistent operations over various bending cycles at flexural strain of 1.1%. Low gate leakage current  $(I_G)$  after 10,000-times bending is shown as a dotted line; **b**) threshold voltage  $(V_{th})$  distribution. **c**) a photograph of the flexible organic flash memory fabricated on a 100 µm-thick PET film under bending test.



#### **Supplementary Figure 5.**

**Flexible** C<sub>60</sub>-based thin-film transistor (TFT) characteristics: TFT performance vs. flexural strain. a) OTFT structure with pV3D3 and pEGDMA used to test the flexibility of TFT devices. **b)** Transfer curves of a flexible OTFT on a 250 µm-thick PET film measured for flexural strain of 0 % to 4.0 %. **c)-f)** Device characteristics of the C<sub>60</sub>-based TFT obtained for various strain values up to 4% : **c)** gate leakage current ( $I_G$ ), **d)** hysteresis (= $|V_{G,\text{forward}}|$  $@ I_{D} = 100 \text{ nA} - V_{G, \text{backward}} @ I_{D} = 100 \text{ nA}$ ), **e)** threshold voltage (*V*<sub>th</sub>), and **f)** saturation mobility degradation ( $\mu_{\text{sat}}/\mu_{\text{sat,0}}$ ). Degradation is shown to be not siginifcant until the strain of 3.3 % is applied.



### **Supplementary Figure 6.**

**Custom-built test setup for characterization of foldable memory devices**. The testing assembly is based on a test fixture and a slider equipped on a computer-controlled motorized translator, whose speed can be varied from a few  $\mu$ m s<sup>-1</sup> to a few cm s<sup>-1</sup>. The test fixture consists of a sample loader, a rigid piano wire, and two wire presses, one of which contains a tuner mechanism used in a guitar for proper tension management. To minimize the handling issue of ultrathin devices on Mylar<sup>TM</sup> substrates, foldability test process is carefully done as follow: **a**) setting up the components of the test fixture, **b**) sliding in one end of a sample underneath the wire and attaching that end on the loader using a PET-tape, **c)** folding the sample over the wire and attaching the opposite end of the sample on the slider via a long extended Kapton tape. The slider is then translated back and forth to fold or unfold a device against the piano wire. (See Supplementary Video 1 and 2 to see the folding test operation in action.) Example pictures of half-released and fully folded states are provided as insets. **d)** Photograph of the test assembly integrated with a manual probe station. e) The enlarged version of Fig. 3e in the main text.



#### **Supplementary Figure 7.**

**Comparison of the present work with the TFT-based non-volatile memory devices reported in the literature.** The graph classifies the devices according to the kinds of BDL/TDL layers; inorganic or organic ones. Those with 'ε' values correspond to the performance data of flexible memory devices. In those cases, ' $\varepsilon$ ' refers to the maximum strain tested in the corresponding work. The points in the figure is generated using the date in Supplementary Table 2, and the numbers in square brackets refer to reference numbers in the main text.



## **Supplementary Figure 8.**

**Planarization of paper with iCVD grown polymers effect on paper for disposable organic flash memories**. Scanning electron microscope (SEM) images obtained for the top surfaces of the following: **a)** a pristine dye-sublimation paper (DP), **b)** 1.5 µm-thick poly(divinylbenzene) (pDVB) layer on the pristine DP, **c**) DP/ Al/ pEGDMA/ Al/ pV3D3/ C<sub>60</sub>, and **d**) DP/ pDVB(1.5µm )/ Al/ pEGDMA/ Al/ pV3D3/ C<sub>60</sub>.



#### **Supplementary Table 1.**

| <b>State</b>               | Layer              | $V_{\text{layer}}^{(\text{total})}$ (=the total voltage across a given layer for $V_{\text{CG}}$ )                                                  |
|----------------------------|--------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------|
| initial                    | BDL (pEGDMA)       | $V_{\text{bi},\text{pEGDMA}}$                                                                                                                       |
|                            | TDL (pV3D3)        | $V_{\text{bi},\text{pV3D3}}$                                                                                                                        |
|                            | Channel $(C_{60})$ | $\sim 0$                                                                                                                                            |
| Programmed<br>$(P_0)$      | BDL (pEGDMA)       | $V_{\text{bi},\text{pEGDMA}} + \Delta V_{\text{th},\text{pEGDMA}}^{\text{P}_0}$                                                                     |
|                            | TDL (pV3D3)        | $V_{\text{bi},\text{pV3D3}} + \Delta V_{\text{th},\text{pV3D3}}^{\text{Po}}$                                                                        |
|                            | Channel $(C_{60})$ | $\Delta V_{\text{th}}^{\text{P}_0}$                                                                                                                 |
| Erased<br>$(E_0)$          | BDL (pEGDMA)       | $V_{\text{bi},\text{pEGDMA}} + \Delta V_{\text{th},\text{pEGDMA}}^{\text{E}_0}$                                                                     |
|                            | TDL (pV3D3)        | $V_{\text{bi},\text{pV3D3}} + \Delta V_{\text{th},\text{pV3D3}}^{\text{E}_0}$                                                                       |
|                            | Channel $(C_{60})$ | $\sim 0$                                                                                                                                            |
| Being programmed<br>(Prg.) | BDL (pEGDMA)       | $(V_{\text{bi},\text{pEGDMA}} + \Delta V_{\text{th},\text{pEGDMA}}^{\text{E}_0}) + (1 - \alpha_{\text{CR}}^{\text{prg}})V_{\text{CG}}^{\text{prg}}$ |
|                            | TDL (pV3D3)        | $(V_{\text{bi},\text{pV3D3}} + \Delta V_{\text{th},\text{pV3D3}}^{\text{E}_0}) + \alpha_{\text{CR}}^{\text{prg}}V_{\text{CG}}^{\text{prg}}$         |
|                            | Channel $(C_{60})$ | $\sim 0$                                                                                                                                            |
| Being erased<br>(Ers.)     | BDL (pEGDMA)       | $(V_{\text{bi},\text{pEGDMA}} + \Delta V_{\text{th},\text{pEGDMA}}^{\text{P}_0}) + (1 - \alpha_{\text{CR}}^{\text{ers}})V_{\text{CG}}^{\text{ers}}$ |
|                            | TDL (pV3D3)        | $(V_{\text{bi},\text{pV3D3}} + \Delta V_{\text{th},\text{pV3D3}}^{\text{P}_0}) + \alpha_{\text{CR}}^{\text{ers}} V_{\text{CG}}^{\text{ers}} \beta$  |
|                            | Channel $(C_{60})$ | $\Delta V_{\text{th},C_{\text{c}}\text{}}^{\text{P}_0} + \alpha_{\text{CR}}^{\text{ers}} V_{\text{CG}}^{\text{ers}}(1-\beta)$                       |

The net voltage across a layer of interest for a given  $V_{CG}$  in the proposed memory vs. operation conditions.

The table shows the total voltage across a given layer of interest  $(V_{\text{layer}}^{(\text{total})})$  expected in the proposed memory biased at  $V_{\text{CG}}$  for each of the device states: initial, programmed ( $\mathbf{P}_0$ ;  $V_{CG} = 0$ ), erased ( $\mathbf{E}_0$ ;  $V_{CG} = 0$ ), being programmed ( $\mathbf{Prg}$ ;  $V_{CG} = V_{CG}^{prg}$ ), or being erased ( $\mathbf{Ers}$ ;  $V_{CG} = V_{CG}^{ers}$ ).  $V_{layer}^{(total)}$ reflects (i) the effect of built-in voltages ( $V_{bi, \text{layer}}$ ) owing to the initial Fermi level alignment in thermal equilibrium; (ii) the effect of the stored or erased charges that appear as threshold voltage shift  $({\Delta V_{th,layer}^{F_0}})$  or  ${\Delta V_{th,layer}^{E_0}}$ ); and (iii) that of the applied bias influenced by the coupling ratio,  $\alpha_{\text{CR}}^{\text{prg}}$  or  $\alpha_{\text{CR}}^{\text{ers}}$ . Note that, during the erasing operation, C<sub>60</sub> channel is set to be off (i.e. insulating), and thus  $V_{\text{a,FG}}$  is not solely applied to TDL (pV3D3) but divided into pV3D3 and C<sub>60</sub> with the ratio of  $V_{a, pV3D3}$  to  $V_{a, C60}$  (= $\beta$ ) given by:

$$
\beta = \frac{V_{a,pV3D3}}{V_{a,C_{60}}} = \frac{d_{pV3D3}k_{C_{60}}}{d_{C_{60}}k_{pV3D3} + d_{pV3D3}k_{C_{60}}} \qquad \dots \quad (1)
$$

It should also be noted that both which  $\alpha_{CR}^{\text{ers}}$  differs from  $\alpha_{CR}^{\text{prg}}$  because the effective area and thickness used for definition of capacitance (*C*) across FG/TDL/C<sub>60</sub>/(D or S) are different as shown in Fig. 2c and 2d in the main text. They are given by the following equations:

$$
C = \varepsilon_0 \frac{k_{\text{material}} \times A_{\text{effective-area}}}{V_{\text{a},C_{60}}} \qquad \dots \qquad (2)
$$
\n
$$
\alpha_{\text{CR}}^{\text{prg}} = \frac{C_{\text{BDL}}^{\text{prg}}}{C_{\text{BDL}}^{\text{prg}}} = \frac{C_{\text{PEGDMA}}^{\text{prg}}}{C_{\text{PEGDMA}}^{\text{prg}}} + C_{\text{PV3D3}}^{\text{prg}} \qquad \dots \qquad (3)
$$
\n
$$
\alpha_{\text{CR}}^{\text{ers}} = \frac{C_{\text{BDL}}^{\text{ers}}}{C_{\text{BDL}}^{\text{ers}}} + C_{\text{TDL}}^{\text{ers}} || C_{\text{channel}}^{\text{ers}} = \frac{C_{\text{PEGDMA}}^{\text{ers}}}{C_{\text{PEGDMA}}^{\text{ers}}} + C_{\text{PV3D3}}^{\text{ers}} || C_{C_{60}}^{\text{ers}} \qquad \dots \qquad (4)
$$
\n
$$
\text{when } C_1 || C_2 = \frac{C_1 C_2}{C_1 + C_2}
$$

## **Supplementary Table 2**



# **Summary of the structure and performance of flash-type memory devices based on organic or emerging materials**

\* "Pen" = pentacene; "Grap"= graphene; "BP" = black phosphorous; "F8T2" = Poly(9,9-dioctylfluorene-alt-bithiophene); "P3HT" = Poly(3-hexylthiophene)

† Square brackets were used to indicate the case of multi-component BDL, CSL, or TDL, where CSL refers to a charge storage layer, which could be FG, metal nano-particles (NPs), and so on. Those without values in parentheses indicate that thickness values were not reported in the corresponding papers. "HBN" refers to hexagonal boron nitride.

 $\frac{8}{v_{\text{op}}}$  *V*<sub>op</sub> = the larger of  $V_{\text{prog}}$  and  $V_{\text{erase}}$ 

#### **Supplementary Note 1.**

**Further description on coupling ratio.** Let us consider an S/TDL/FG/BDL/CG structure shown in Supplementary Fig. 1c, which is equivalent to the memory structure underneath S electrode except for the absence of a semiconductor layer. Let us assume S, FG, and CG are all based on the same kind of metal for simplicity's sake. Under capacitive approximation, 'coupling ratio' ( $\alpha_{CR}$ ) or the ratio of the voltage between S and the FG across TDL (= $V_{FG}$ ) to  $V_{CG}$  and the electric fields across TDL and BDL ( $E_{TDL}$ ,  $E_{BDL}$ ) are then given by:

$$
\alpha_{\rm CR} = \frac{V_{\rm FG}}{V_{\rm CG}} = \frac{1}{C_{\rm TDL}} \left(\frac{1}{C_{\rm TDL}} + \frac{1}{C_{\rm BDL}}\right)^{-1} = \frac{d_{\rm TDL}k_{\rm BDL}A_{\rm BDL}}{d_{\rm TDL}k_{\rm BDL}A_{\rm BDL}} \qquad \cdots \qquad (5)
$$
\n
$$
E_{\rm TDL} = \frac{V_{\rm TDL}}{d_{\rm TDL}} = \frac{V_{\rm FG}}{d_{\rm TDL}} = \frac{\alpha_{\rm CR}V_{\rm CG}}{d_{\rm TDL}} = \frac{k_{\rm BDL}A_{\rm BDL}}{d_{\rm TDL}k_{\rm BDL}A_{\rm BDL}} + \frac{k_{\rm BDL}A_{\rm TDL}}{d_{\rm BDL}k_{\rm TDL}A_{\rm TDL}} V_{\rm CG} \qquad \cdots \qquad (6)
$$
\n
$$
E_{\rm BDL} = \frac{V_{\rm BDL}}{d_{\rm BDL}} = \frac{V_{\rm CG} - V_{\rm FG}}{d_{\rm BDL}} = \frac{(1 - \alpha_{\rm CR})V_{\rm CG}}{d_{\rm BDL}} = \frac{k_{\rm TDL}A_{\rm TDL}}{d_{\rm TDL}k_{\rm BDL}A_{\rm BDL}} + \frac{k_{\rm DDL}A_{\rm TDL}}{d_{\rm BDL}A_{\rm BDL}} V_{\rm CG} \qquad (7)
$$

where  $k_i$  and  $d_i$  ( $i = BDL$  or TDL) are the dielectric constant and thickness of BDL or TDL and  $C_i$  is the capacitance of a FG/TDL/S (for  $i = 'TDL'$ ) or CG/BDL/FG (for  $i = 'BDL'$ ) determined by  $k_i$ ,  $d_i$ , and the device area  $A_i$ .

Using Eqs.(5)–(7), one can easily tell how large a field will be applied across BDL and TDL, respectively, when  $V_{CG}$  is given. Through comparison between Eq. (6) and Eq.(7), one can easily recognize  $E_{BDL}/E_{TDL} = k_{TDL}/(k_{BDL}/B_{BDL})$ ; hence, having BDL with a higher *k*-value than TDL is very important to keep  $E_{BDL}$  remaining lower than  $E_{TDL}$ . Supplementary Fig. 1c was constructed (i) by individually measuring the electrical characteristics for FG/BDL/CG (Supplementary Fig. 1a) and S/TDL/FG (Supplementary Fig. S1b) and then (ii) by mapping their applied bias to  $V_{CG}$  via Eqs.(4)-(7). With the same  $A_{BDL}$  and  $A_{TDL}$ , one can expect that higher  $k_{BDL}$  than  $k_{TDL}$  causes lower  $E_{BDL}$ , heading to lower  $J_{BDL}$ than  $J_{\text{TDL}}$  at high  $V_{\text{CG}}$  as shown in Supplementary Fig. 1c.