Pulse

By GCN Staff

Blog archive

Not all clouds created equal

A major bottleneck in scientific discovery is now emerging because the amount of data available is outpacing local computing capacity, according to authors of new paper published on PLOSone.

And though cloud computing gives researchers a way to match capacity and power with demand, the authors wondered which cloud configuration would best met their needs.  According to the paper, Benchmarking undedicated cloud computing providers for analysis of genomic datasets, the authors benchmarked two cloud services, Amazon Web Services Elastic MapReduce (EMR) on Amazon EC2 instances and Google Compute Engine (GCE), using publicly available genomic data sets and a standard bioinformatic pipeline on a Hadoop-based platform.

They found that GCE outperformed EMR both in terms of cost and wall-clock time, though EMR was more consistent, which is an important issue in undedicated cloud computing, they wrote.

The time differences, the authors said, “could be attributed to the hardware used by the Google and Amazon for their cloud services. Amazon offers a 2.0 GHz Intel Xeon Sandy Bridge CPU, whilst Google uses a 2.6 GHz Intel Xeon Sandy Bridge CPU. This clock speed variability is considered the main contributing factor to the difference between the two undedicated platforms,” they wrote.

The authors did note that while cloud computing is an “efficient and potentially cost-effective alternative for analysis of large genomic data sets,” the initial transfer of the data into the cloud was still a challenge. One option, they suggested, would be for the data providers to directly deposit the information to a designated cloud service provider, thereby eliminating the need for the researcher to handle the data twice.

More detail about the benchmarking and results are available on PLOSone

Posted by GCN Staff on Oct 01, 2014 at 1:28 PM


Featured

  • FCW Perspectives
    human machine interface

    Your agency isn’t ready for AI

    To truly take advantage, government must retool both its data and its infrastructure.

  • Cybersecurity
    secure network (bluebay/Shutterstock.com)

    Federal CISO floats potential for new supply chain regs

    The federal government's top IT security chief and canvassed industry for feedback on how to shape new rules of the road for federal acquisition and procurement.

  • People
    DHS Secretary Kirstjen Nielsen, shown here at her Nov. 8, 2017, confirmation hearing. DHS Photo by Jetta Disco

    DHS chief Nielsen resigns

    Kirstjen Nielsen, the first Homeland Security secretary with a background in cybersecurity, is being replaced on an acting basis by the Customs and Border Protection chief. Her last day is April 10.

Stay Connected

Sign up for our newsletter.

I agree to this site's Privacy Policy.