Bioinformatics has a long history of software solutions developed on multi-core computing systems for solving computational intensive problems. This option suffer from some issues solvable by shifting to Distributed Systems. In particular, the MapReduce computing paradigm, and its implementations, Hadoop and Spark, is becoming increasingly popular in the Bioinformatics field because it allows for virtual-unlimited horizontal scalability while being easy-to-use. Here we provide a qualitative evaluation of some of the most significant MapReduce bioinformatics applications. We also focus on one of these applications to show the importance of correctly engineering an application to fully exploit the potential of Distributed Systems.
|Titolo della pubblicazione ospite||Reference Module in the Life Sciences|
|Numero di pagine||0|
|Stato di pubblicazione||Published - 2017|
All Science Journal Classification (ASJC) codes