Big Data has become an increasingly large presence in the life science R&D world, but as I have blogged about previously, increasingly larger datasets and better machine algorithms alone, will not leverage that data into bankable knowledge and can lead to erroneous inferences. My Amber Biology colleague, Gordon Webster has a great post over on LinkedIn leavening the hype around Big Data, pointing out that analytics and visualizations alone are insufficient for making progress in extracting knowledge from biological datasets:
Applying the standard pantheon of data analytics and data visualization techniques to large biological datasets, and expecting to draw some meaningful biological insight from this approach, is like expecting to learn about the life of an Egyptian pharaoh by excavating his tomb with a bulldozer
“-omics” such as those produced by transcriptomic and proteomic analyses are ultimately generated by dynamic processes consisting of individual genes, proteins and other molecules…
View original post 130 more words