Skip to main content
. 2019 Feb 20;21(2):e12783. doi: 10.2196/12783

Table 2.

Samples of the training data corpus for the English subtask.

Tweet ID Message s1a s2 s3 s4 s5 s6 s7 s8
1enb The cold makes my whole body weak. pc nd n n n n n n
2en It’s been a while since I’ve had allergy symptoms. n n n n p n n p
3en I’m so feverish and out of it because of my allergies. I’m so sleepy. n n n p p n n p
4en I took some medicine for my runny nose, but it won’t stop. n n n n n n n p
5en I had a bad case of diarrhea when I traveled to Nepal. n n n n n n n n
6en It takes a millennial wimp to call in sick just because they’re coughing. It’s always important to go to work, no matter what. n p n n n n n n
7en I’m not going today, because my stuffy nose is killing me. n n n n n n n p
8en I never thought I would have allergies. n n n n p n n p
9en I have a fever but I don’t think it’s the kind of cold that will make it to my stomach. p n n p n n n n
10en My phlegm has blood in it and it’s really gross. n p n n n n n n

as1, s2, s3, s4, s5, s6, s7, and s8 are IDs of the 8 symptoms (cold, cough, diarrhea, fever, hay fever, headache, flu, and runny nose).

bID corresponds to the corpora of other languages (eg, the tweet of 1en corresponds to the tweets of 1ja and 1zh).

cp indicates the positive label.

dn indicates the negative label.