Foundations and Trends® in Databases > Vol 15 > Issue 1

Robust Query Processing: A Survey

By Jayant R. Haritsa, Indian Institute of Science, India, haritsa@iisc.ac.in

 
Suggested Citation
Jayant R. Haritsa (2024), "Robust Query Processing: A Survey", Foundations and TrendsĀ® in Databases: Vol. 15: No. 1, pp 1-114. http://dx.doi.org/10.1561/1900000089

Publication Date: 17 Dec 2024
© 2024 J. R. Haritsa
 
Subjects
Query processing and optimization,  Adaptive query processing,  Storage, access methods, and indexing,  Database design and tuning,  Data warehousing,  Robustness,  Deep learning
 

Free Preview:

Download extract

Share

Download article
In this article:
1. Introduction
2. Background to Robust Query Processing
3. Robust Operators
4. Robust Plans
5. Robust Query Execution
6. Structural Robustness Bounds
7. Robust Cost Models
8. Machine Learning-based Techniques
9. Holistic Robustness
10. Future Research Directions
11. Additional Reading
Acknowledgements
References

Abstract

The primordial function of a database system is to efficiently compute correct answers to user queries. Therefore, robust query processing (RQP), where strong numerical guarantees are provided on query performance, has been a long-standing core objective in the design of industrial-strength database engines. Unfortunately, however, RQP has proved to be a largely intractable and elusive challenge, despite sustained efforts spanning several decades. This problematic situation has arisen from a variety of knotty technical hurdles, including complex query representations, limited metadata coverage, coarse statistical models, and hypersensitive operator behaviors. Its impact is felt acutely since the performance degradation faced by database queries can be huge, reaching orders of magnitude as compared to an oracular ideal.

Notwithstanding this daunting history, the good news is that in recent times, there have been a host of exciting technical advances that collectively promise to materially address the robustness objective. The new approaches have been constructed at different levels in the database architecture, and tackle robustness in cost models, database operators, query execution plans and query processing strategies. Although most of this literature is based on statistical and geometric formulations, a significant corpus of machine learning-based techniques is also now available.

In this monograph, we present an overview of these novel research paradigms, and highlight their strengths and limitations. Further, we enumerate a suite of open technical problems that remain to be solved to make RQP a contemporary reality.

DOI:10.1561/1900000089
ISBN: 978-1-63828-426-0
134 pp. $90.00
Buy book (pb)
 
ISBN: 978-1-63828-427-7
134 pp. $155.00
Buy E-book (.pdf)
Table of contents:
1. Introduction
2. Background to Robust Query Processing
3. Robust Operators
4. Robust Plans
5. Robust Query Execution
6. Structural Robustness Bounds
7. Robust Cost Models
8. Machine Learning-based Techniques
9. Holistic Robustness
10. Future Research Directions
11. Additional Reading
Acknowledgements
References

Robust Query Processing: A Survey

The primary function of a database system is to efficiently compute correct answers to user queries. Therefore, robust query processing (RQP), where strong numerical guarantees are provided on query performance, has been a long-standing core objective in the design of industrial-strength database engines. Unfortunately, however, RQP has proved to be a largely intractable and elusive challenge, despite sustained efforts spanning several decades. In this monograph, a holistic coverage of the RQP innovations is provided, and strengths and limitations are highlighted. Further, open technical problems that remain to be solved to make RQP a contemporary reality are also enumerated.

In this monograph, representative techniques along these various dimensions are covered. After the introduction, a background to RQP is given. In the chapters thereafter, the authors cover Robust Operators, Plans, and Execution, and then Structural Bounds, Cost Models and Machine Learning Techniques are surveyed. The monograph concludes with a chapter on Holistic Robustness, and Future Directions.

The target audience for this monograph includes researchers, developers and students with an interest in the internals of database engines.

 
DBS-089