Header Information

NPRP 4 - 1534 - 1 - 247
Qatar University
Award Closed
01 Apr 2012
Prof. Qutaibah Malluhi
3 Year(s)
01 Mar 2016
Scientific Data Management in the Cloud

Project Summary
Dealing with large data is no longer the exclusive domain of big labs. Recent technological innovations have greatly increased the rate at which scientific data is collected, and have made scientific data easily accessible to small teams of scientists. However, the cost of storing, analyzing, and sharing data, as well as maintaining the needed infrastructure are too high for small labs to bear. Cloud computing comes handy as it reliefs small labs from the software and hardware maintenance as well as provides massive computing and storage capabilities as needed without the associated overheads. We propose to develop and deploy a cloud-enabled administration-free scientific data manager and collaboration environment. This will offer scientific data management and analysis tools via easy-to-use cloud-based data and analysis workspaces that will enable scientists to tackle large scientific problems with the least IT overhead. The proposed system innovatively tackles key requirements of managing scientific data including provenance and annotation management, supporting dependencies involving user actions, and similarity-based query processing. This system will facilitate collaborations among scientists while guaranteeing the right levels of security and privacy of the scientists' data.
As Qatar strives to achieve its Vision 2030 by building a knowledge based society, enabling scientific innovation becomes critical. Moreover, building research capacity in Qatar is transpiring through programs like NPRP that foster collaboration between Qatari researchers and scientists outside Qatar. Therefore, this project would clearly contribute to Qatar’s aspirations by equipping scientists with tools that enable easy management and sharing of large data sets, and facilitate collaboration among scientists. The project will target domains that are relevant to Qatar and the region. In particular, we will initially focus on case studies from the biomedical sciences field, which has been identified as one of the three strategic focus research areas for Qatar (see the Potential Applications section). This work complements and contributes to a number of important ongoing national initiatives including the Qatar Cloud Computing center, the Qatar Computing Research Institute, the Qatar Biomedical Research Institute, and the Sidra Medical and Research Center. The project does have an important educational component that includes student training, education and research. It is also expected that the project will lead to the introduction of modules in existing CS courses. Therefore, this effort contributes to building human capital in Qatar (the first stated objective of NPRP).
Bioinformatics; Cloud Computing; Parallel Processing; Data Management; High-Performance Computing
Applied research
1. Natural Sciences
1.2 Computer and Information Sciences
Information Science and Bioinformatics

Qatar University
Submitting Institution
Purdue University
United States
Collaborative Institution

Lead PI
Prof. Qutaibah Malluhi
Qatar University
Co-Lead PI
Prof. Qutaibah Malluhi
Qatar University
Dr. Michael Gribskov
Purdue University
Dr. Mourad Ouzzani
Hamad Bin Khalifa University
Prof. Walid Aref
Purdue University

Conference Paper
The Similarity-Aware Relational Intersect Database Operator
Agma Juci Machado Traina-editor-first, Caetano Traina-editor-additional, Robson Leonardo Ferreira Cordeiro-editor-additional
Conference Paper
The Similarity-aware Relational Intersect Database Operator
Wadha J. Al Marri, Qutaibah Malluhi, Mourad Ouzzani, Mingjie Tang, Walid G. Aref
Journal Paper
The similarity-aware relational database set operators
WJ Al Marri, Q Malluhi, M Ouzzani, M Tang, WG Aref
Journal Paper
Similarity Group-by Operators for Multi-Dimensional Relational Data
MingJie Tang, Ruby Y. Tahboub, Walid G. Aref, Mikhail J. Atallah, Qutaibah M. Malluhi, Mourad Ouzzani, Yasin N. Silva
Conference Paper
Approving updates in collaborative databases
K Mershad, QM Malluhi, M Ouzzani, M Tang, WG Aref
Conference Paper
Efficient Processing of Hamming-Distance-Based Similarity-Search Queries
M Tang, Y Yu, WG Aref, QM Malluhi, M Ouzzani
Journal Paper
Efficient Parallel Skyline Query Processing for High-Dimensional Data
Mingjie Tang ; Yongyang Yu; Walid G. Aref; Qutaibah M. Malluhi; Mourad Ouzzani
Journal Paper
COACT: a query interface language for collaborative databases
Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref, Deo Prakash5
Journal Paper
AUDIT: approving and tracking updates with dependencies in collaborative databases
Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref
Conference Paper
In-Memory Distributed Matrix Computation Processing and Optimization
Yongyang Yu-author-first, Mingjie Tang-author-additional, Walid G. Aref-author-additional, Qutaibah M. Malluhi-author-additional, Mostafa M. Abbas-author-additional, Mourad Ouzzani-author-additional
Conference Paper
Mingjie Tang-author-first, Yongyang Yu-author-additional, Qutaibah M. Malluhi-author-additional, Mourad Ouzzani-author-additional, Walid G. Aref-author-additional