PhD Preliminary Oral Exam – Vahid Ghadakchi

Developing Effective Query Interfaces Over Structured and Semi-Structured Data

Many database users are not familiar with formal query languages, the concept of schema, or the exact content of their database. Thus, it is challenging for these users to formulate their information needs over semi-structured and structured databases. To address this problem, researchers have proposed usable query interfaces over which users can formulate their information need without knowing about formal query languages, schema or the exact content of the database. Although the mentioned interfaces increase the usability of the databases, they inherently suffer from low search effectiveness. The recent growth in databases' content size and schema complexity only exacerbates this problem. In this proposal, we present theoretical and empirical results on the impact of database size on search effectiveness and describe an approach that uses only a relatively small subset of the database to answer most queries effectively. Since this subset may not contain the relevant answers to many queries, we also propose a method that predicts whether a query can be answered more effectively using this subset or the entire database. Our comprehensive empirical studies using multiple real-world databases and query workloads indicate that our approach significantly improves both the effectiveness and efficiency of answering queries. Furthermore, we provide theoretical and empirical results on database transformations that will increase or decrease the search effectiveness. Using these results one can compare different schemas in terms of their search effectiveness.

Major Advisor: Arash Termehchy
Committee: Alan Fern
Committee: Liang Huang
Committee: Prasad Tadepalli
GCR: Yelda Turkan

Tuesday, June 11, 2019 at 2:00pm to 4:00pm

Kelley Engineering Center, 1007
110 SW Park Terrace, Corvallis, OR 97331

Electrical Engineering and Computer Science
Calvin Hughes

