20th Oct 2014

GARNet is pleased announce the publication of a new paper in Journal of Experimental Botany

Data Mining with iPlant is a report from the four-day workshop we hosted at the University of Warwick in 2013 in conjunction with the iPlant Collaborative. The report discusses the 'data deluge' facing plant science and highlights the ways in which iPlant's cyberinfrastructure can help to mitigate the Big Data problem, and bioinformatics challenges in teaching and research, in terms of providing free, open access to high performance computing resources and intuitive cyberinfrastructure.


High-throughput sequencing technologies have rapidly moved from large international sequencing centres to individual laboratory benchtops. These changes have driven the ‘data deluge’ of modern biology. Submissions of nucleotide sequences to GenBank, for example, have doubled in size every year since 1982, and individual data sets now frequently reach terabytes in size. While ‘big data’ present exciting opportunities for scientific discovery, data analysis skills are not part of the typical wet bench biologist’s experience. Knowing what to do with data, how to visualize and analyse them, make predictions, and test hypotheses are important barriers to success. Many researchers also lack adequate capacity to store and share these data, creating further bottlenecks to effective collaboration between groups and institutes. The US National Science Foundation-funded iPlant Collaborative was established in 2008 to form part of the data collection and analysis pipeline and help alleviate the bottlenecks associated with the big data challenge in plant science. Leveraging the power of high-performance computing facilities, iPlant provides free-to-use cyberinfrastructure to enable terabytes of data storage, improve analysis, and facilitate collaborations. To help train UK plant science researchers to use the iPlant platform and understand how it can be exploited to further research, GARNet organized a four-day Data mining with iPlant workshop at Warwick University in September 2013. This report provides an overview of the workshop, and highlights the power of the iPlant environment for lowering barriers to using complex bioinformatics resources, furthering discoveries in plant science research and providing a platform for education and outreach programmes.

Martin L, Cook C, Matasci N, Williams J and Bastow R (2014). Data Mining with iPlant: A meeting report from the 2013 GARNet workshop 'Data Mining with iPlant', Journal of Experimental Botany, DOI: 10.1093/jxb/eru402 can be accessed via this toll-free link