Header Information

NPRP 4 - 1534 - 1 - 247
NPRP 04
Qatar University
Award Closed
01 Apr 2012
Prof. Qutaibah Malluhi
3 Year(s)
01 Mar 2016
New
Scientific Data Management in the Cloud

Project Summary
Dealing with large data is no longer the exclusive domain of big labs. Recent technological innovations have greatly increased the rate at which scientific data is collected, and have made scientific data easily accessible to small teams of scientists. However, the cost of storing, analyzing, and sharing data, as well as maintaining the needed infrastructure are too high for small labs to bear. Cloud computing comes handy as it reliefs small labs from the software and hardware maintenance as well as provides massive computing and storage capabilities as needed without the associated overheads. We propose to develop and deploy a cloud-enabled administration-free scientific data manager and collaboration environment. This will offer scientific data management and analysis tools via easy-to-use cloud-based data and analysis workspaces that will enable scientists to tackle large scientific problems with the least IT overhead. The proposed system innovatively tackles key requirements of managing scientific data including provenance and annotation management, supporting dependencies involving user actions, and similarity-based query processing. This system will facilitate collaborations among scientists while guaranteeing the right levels of security and privacy of the scientists' data.
As Qatar strives to achieve its Vision 2030 by building a knowledge based society, enabling scientific innovation becomes critical. Moreover, building research capacity in Qatar is transpiring through programs like NPRP that foster collaboration between Qatari researchers and scientists outside Qatar. Therefore, this project would clearly contribute to Qatar’s aspirations by equipping scientists with tools that enable easy management and sharing of large data sets, and facilitate collaboration among scientists. The project will target domains that are relevant to Qatar and the region. In particular, we will initially focus on case studies from the biomedical sciences field, which has been identified as one of the three strategic focus research areas for Qatar (see the Potential Applications section). This work complements and contributes to a number of important ongoing national initiatives including the Qatar Cloud Computing center, the Qatar Computing Research Institute, the Qatar Biomedical Research Institute, and the Sidra Medical and Research Center. The project does have an important educational component that includes student training, education and research. It is also expected that the project will lead to the introduction of modules in existing CS courses. Therefore, this effort contributes to building human capital in Qatar (the first stated objective of NPRP).
Bioinformatics; Cloud Computing; Parallel Processing; Data Management; High-Performance Computing
Applied research
1. Natural Sciences
1.2 Computer and Information Sciences
Information Science and Bioinformatics
Yes
No

Institution
Qatar University
Qatar
Submitting Institution
Purdue University
United States
Collaborative Institution

Personnel
Lead PI
Prof. Qutaibah Malluhi
Qatar University
Co-Lead PI
Prof. Qutaibah Malluhi
Qatar University
PI
Dr. Michael Gribskov
Purdue University
PI
Dr. Mourad Ouzzani
Hamad Bin Khalifa University
PI
Prof. Walid Aref
Purdue University

Outputs/Outcomes
Conference Paper
The Similarity-Aware Relational Intersect Database Operator
Agma Juci Machado Traina-editor-first, Caetano Traina-editor-additional, Robson Leonardo Ferreira Cordeiro-editor-additional
DOI:10.1007/978-3-319-11988-5
Conference Paper
The Similarity-aware Relational Intersect Database Operator
Wadha J. Al Marri, Qutaibah Malluhi, Mourad Ouzzani, Mingjie Tang, Walid G. Aref
DOI:10.1007/978331911988515
Journal Paper
The similarity-aware relational database set operators
WJ Al Marri, Q Malluhi, M Ouzzani, M Tang, WG Aref
ISSN:0306-4379
Journal Paper
Similarity Group-by Operators for Multi-Dimensional Relational Data
MingJie Tang, Ruby Y. Tahboub, Walid G. Aref, Mikhail J. Atallah, Qutaibah M. Malluhi, Mourad Ouzzani, Yasin N. Silva
ISSN:1041-4347
Conference Paper
Approving updates in collaborative databases
K Mershad, QM Malluhi, M Ouzzani, M Tang, WG Aref
DOI:10.1109/ic2e.2015.31
Conference Paper
Efficient Processing of Hamming-Distance-Based Similarity-Search Queries
M Tang, Y Yu, WG Aref, QM Malluhi, M Ouzzani
DOI:10.5441/002/edbt.2015.32
Journal Paper
Efficient Parallel Skyline Query Processing for High-Dimensional Data
Mingjie Tang ; Yongyang Yu; Walid G. Aref; Qutaibah M. Malluhi; Mourad Ouzzani
ISSN:10414347
Journal Paper
COACT: a query interface language for collaborative databases
Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref, Deo Prakash5
ISSN:15737578
Journal Paper
AUDIT: approving and tracking updates with dependencies in collaborative databases
Khaleel Mershad, Qutaibah M. Malluhi, Mourad Ouzzani, Mingjie Tang, Michael Gribskov, Walid G. Aref
ISSN:09268782
Conference Paper
In-Memory Distributed Matrix Computation Processing and Optimization
Yongyang Yu-author-first, Mingjie Tang-author-additional, Walid G. Aref-author-additional, Qutaibah M. Malluhi-author-additional, Mostafa M. Abbas-author-additional, Mourad Ouzzani-author-additional
DOI:10.1109/ICDE.2017.150
Conference Paper
LocationSpark
Mingjie Tang-author-first, Yongyang Yu-author-additional, Qutaibah M. Malluhi-author-additional, Mourad Ouzzani-author-additional, Walid G. Aref-author-additional
DOI:10.14778/3007263.3007310