"Class‐paired Fuzzy SubNETs: A paired variant of the rank‐based network analysis family for feature selection based on protein complexes"

Identifying reproducible yet relevant protein features in proteomics data is a major challenge. Analysis at the level of protein complexes can resolve this issue and we have developed a suite of feature‐selection methods collectively referred to as Rank‐Based Network Analysis (RBNA). RBNAs differ in their individual statistical test setup but are similar in the sense that they deploy rank‐defined weights among proteins per sample. This procedure is known as gene fuzzy scoring. Currently, no RBNA exists for paired‐sample scenarios where both control and test tissues originate from the same source (e.g. same patient). It is expected that paired tests, when used appropriately, are more powerful than approaches intended for unpaired samples. We report that the class‐paired RBNA, PPFSNET, dominates in both simulated and real data scenarios. Moreover, for the first time, we explicitly incorporate batch‐effect resistance as an additional evaluation criterion for feature‐selection approaches. Batch effects are class irrelevant variations arising from different handlers or processing times, and can obfuscate analysis. We demonstrate that PPFSNET and an earlier RBNA, PFSNET, are particularly resistant against batch effects, and only select features strongly correlated with class but not batch.

Keywords: class; analysis; protein; feature selection

Journal Title: PROTEOMICS
Year Published: 2017

Link to full text (if available)

Share on Social Media: Sign Up to like & get
recommendations!
1

LAUSR

You are not signed in:

Sign Up!

Related content

More Information News Social Media Video Recommended