Skip to main content
. Author manuscript; available in PMC: 2020 Jan 10.
Published in final edited form as: Pac Symp Biocomput. 2020;25:647–658.

Table 1.

Querying language examples

operation example
view available features df.describe()
filtering df.filter(feature A >10)
grouping df.groupby(feature B)
aggregation df.groupby(feature B).aggregate(feature C).summarize(max, min, mean, median, sd)
correlation df.pearson corr(feature A, feature B) statistical tests df.t test(feature A, feature B) visualization df.hist(feature A)
combination of above df.filter(feature A>10).groupby(feature B).t_test(feature C, feature D)