. 2015 Jul 31;9(6):627–638. doi: 10.1007/s11571-015-9350-4

Table 2.

The GRSOMO and GRSOMU algorithms

(i) Let

T

be the original unbalanced training set with

n + m

samples, where

n > m

. And let

T = P \cup N

, where

P

contains only the data in the minority class with

m

samples and

N

contains only data in the majority class with

n

samples

(ii) IF GRSOMO is desired THEN DO steps (iii)–(v)

(iii) Use GRSOM to generate new data from

P

for

n - m

samples such that

P

is used as an input of the GRSOM function, i.e. GRSOM(

X

t_{max}

) where

X \Leftarrow P

. Then, the function will return the

n - m

samples which are contained in the new grown data set

X^{+}

, i.e.

X^{+} \Leftarrow

GRSOM(

X

t_{max}

). Note that, in the case of over-sampling approach,

t_{max} \Leftarrow n - m - N (t = 0)

(iv)

P^{+} \Leftarrow X^{+}

and

P^{+ +} \Leftarrow P \cup P^{+}

. Thus, the number of samples in the minority class can be adjusted from

m

m + (n - m) = n

samples which equals to the number of samples in the majority class

(v) Define the balanced training set as

T^{+} \Leftarrow P^{+ +} \cup N

and GO TO ix)

(vi) IF GRSOMU is desired THEN DO steps vii) to viii)

(vii) Use GRSOM to generate new data from

N

for

m

samples in which

N

is used as an input of the GRSOM function, i.e. GRSOM(

X

) where

X \Leftarrow N

. Then, the function will return the new grown data set

X^{+}

with

m

samples, i.e.

t_{max} \Leftarrow m - N (t = 0)

X^{+} \Leftarrow

GRSOM(

X

t_{max}

)

(viii)

N^{+} \Leftarrow X^{+}

and

T^{+} \Leftarrow N^{+} \cup P

. Note that only the

m

samples will be generated for the majority class which equals to original number of samples in the minority class. This leads to the balanced training set

T^{+}

(ix) Return

T^{+}

(x) END