QCRI’s Data Analytics department's research expertise transcends the data science stack - from data extraction and cleaning to analytics, retrieval, inference and prediction. The group expresses its research by publishing in the world’s leading computer science conferences and journals, and through engineering prototype software systems that are being used by researchers and practitioners worldwide. More specifically the group is working on the following projects:
- RHEEM is the world’s first cross-platform system which seamlessly decouples data science application tasks from their underlying execution platform. Users specify data analytical tasks, and the RHEEM platform will automatically decide on which combination of platforms to execute the tasks on. The RHEEM platform includes ML4ALL, a machine learning suite that is specifically designed for big data.
- The Data Civilizer project, in partnership with MIT, is carrying out research to resolve a core bottleneck facing all data-rich organizations – how to automatically extract knowledge from thousands of heterogeneous data sources that are scattered throughout an enterprise. The technology developed as part of a project is being used by a large pharmaceutical company for data integration and analysis.
- The Kharita map inference algorithm infers road networks from noisy GPS traces in real time. Kharita is being used to automatically update maps in environments witnessing rapid changes and where existing map and navigation applications quickly become out-of-sync with the ground realities.
- Rayyan is a sophisticated and fast-growing text retrieval system for carrying out systematic reviews in the medical domain. Researchers use Rayyan to efficiently evaluate competing claims regarding a medical hypothesis under investigation.
The Data Analytics department has also established deep research partnerships with The Boeing Company and Qatar Airways in the areas of predictive maintenance and cargo prediction.