云計算是一種通過Internet以服務(wù)的方式提供動態(tài)可伸縮的虛擬化的資源的計算模式?,F(xiàn)今,,隨著高通量測序技術(shù)的迅猛發(fā)展,生物信息學(xué)進入到大數(shù)據(jù)時代,,所引發(fā)的多組學(xué)海量生物數(shù)據(jù)的存儲和分析等問題亟待需要利用云的方式來解決,。
近期,中國科學(xué)院北京基因組研究所基因組科學(xué)與信息重點實驗室的“百人計劃”章張研究員,,與沙特阿卜杜拉國王科技大學(xué)(King Abdullah University of Science and Technology),、北京理工大學(xué)、IBM中國系統(tǒng)與科技研發(fā)中心開展合作研究,,在Biology Direct雜志上發(fā)表了題為Bioinformatics clouds for big data manipulation的學(xué)術(shù)論文,。文中分析了現(xiàn)有生物信息學(xué)領(lǐng)域的云計算服務(wù)(簡稱:生物信息云),根據(jù)其服務(wù)特點首次提出分類方法:數(shù)據(jù)即服務(wù)(DaaS,Data as a Service),、軟件即服務(wù)(SaaS,Software as a Service),、平臺即服務(wù)(PaaS,,Platform as a Service)以及基礎(chǔ)設(shè)施即服務(wù)(IaaS,Infrastructure as a Service),。
生物信息云從四個方面提供了海量生物數(shù)據(jù)的儲存,、獲取、分析等相關(guān)需求的服務(wù),。同時,,文中對云計算在生物信息學(xué)的應(yīng)用進行了展望和討論,提出并分析了以下幾個亟需解決問題,,即生物信息云應(yīng)實現(xiàn)數(shù)據(jù)和軟件的云儲存,,結(jié)合最新的高速傳輸、P2P,、數(shù)據(jù)壓縮等技術(shù)支持大數(shù)據(jù)的傳輸,,開發(fā)基于云的輕量型編程環(huán)境,以及建立開放的生物信息學(xué)云平臺,。(生物谷Bioon.com)
doi:10.1186/1745-6150-7-43
PMC:
PMID:
Bioinformatics clouds for big data manipulation
Lin Dai, Xin Gao, Yan Guo, Jingfa Xiao and Zhang Zhang
As advances in life sciences and information technology bring profound influences on bioinformatics due to its interdisciplinary nature, bioinformatics is experiencing a new leap-forward from in-house computing infrastructure into utility-supplied cloud computing delivered over the Internet, in order to handle the vast quantities of biological data generated by high-throughput experimental technologies. Albeit relatively new, cloud computing promises to address big data storage and analysis issues in the bioinformatics field. Here we review extant cloud-based services in bioinformatics, classify them into Data as a Service (DaaS), Software as a Service (SaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS), and present our perspectives on the adoption of cloud computing in bioinformatics.