文摘
Extracting about twenty key parameters, which could lead to significant performance benefits, from the YARN and MapReduce parameter space, and classifing them into four groups: CPU relevant, memory relevant, disk relevant, and network relevant parameters. Emphasizing the significance of the application characteristics associated with the near-optimal configuration and exploring the unique characteristics of applications in the domain of bioinformatics. Documenting an exemplary case for tuning MapReduce-based bioinformatics applications in the cloud.