I'm a Ph.D. student at the University of Washington, advised by Professor Dongfang Zhao. My research focuses on the intersection of database systems and data retrieval. I develop efficient indexing techniques to optimize data retrieval strategies in large-scale vector databases.
Research Areas: Database Systems, Large-Scale Data Management, Information Retrieval, Indexing Techniques
I am currently researching advanced indexing and search techniques.
Husky 100 E-Portfolio Reflection
My University of Washington experience has transformed how I learn, lead, and persist through uncertainty. Academically, my Ph.D. work in database systems and information retrieval has taught me to design scalable indexing and retrieval methods while balancing theory, experimentation, and real-world constraints. Beyond coursework and papers, I applied research skills such as critical thinking, iteration, and clear communication to projects and collaborations that required building something useful, learning quickly, and adapting to real constraints.
Teaching and mentoring helped me communicate complex ideas clearly and support students with diverse goals. Conference and journal reviewing deepened my understanding of research quality and community responsibility. These roles complemented my research by strengthening my empathy, clarity, collaboration, and focus on impact.
Looking forward, I plan to build trustworthy, efficient retrieval systems and continue mentoring others. I also aim to support women in computer science and AI. UW has prepared me to tackle difficult problems with persistence, accountability, and a commitment to serving the broader community.
NexusIndex is a self-optimizing multimodal framework that integrates multi-model embeddings, attention mechanisms, and a FAISSNexusIndex layer. It enables efficient fake news detection by combining dynamic vector indexing and real-time retrieval across large-scale textual and visual datasets.
VecLSTM is a hybrid framework that enhances trajectory prediction by integrating dynamic vectorization techniques with a CNN–LSTM architecture. It efficiently processes trajectory data within a relational database environment, improving both accuracy and training time compared to traditional LSTM models.
Efficient Feature Extraction for Image Analysis through Adaptive Caching in Vector Databases This research introduces a caching subsystem leveraging in-memory vector databases to enhance the efficiency of image feature extraction. Using advanced models like MobileNetV3 and ResNet50, the framework integrates batch insertion and parallel processing to optimize computational performance.
This research focused on using deep neural networks and hierarchical clustering to predict chaotic transitions in natural convection systems, specifically in the Lorenz system.
Program Committee: The Web Conference (WWW) 2026 — Security & Privacy Track
Reviewer: The Web Conference (WWW) 2025 — Search & Retrieval-Augmented AI, IEEE eScience 2025 , IPDPS 2025 , ICDCS 2025 , ACM CIKM 2024
Journal Reviewer: Journal of Big Data (2024–2025)
Lecturer — North Seattle College
Department of Math & Science
September 2023 – Present, Seattle, WA
• CSC 110: Python Programming
• CSC 142: JAVA Computer Programming I (LEC)
Instructor Page
Lecturer — Central Washington University
Department of Computer Science
September 2022 – 2023, Ellensburg & Des Moines, WA
• CS380: Software Engineering (Java and Project Management)
• CS470: Operating Systems (C, C++, and Linux)
I received my second Master’s degree in Information Technology Infrastructure from Illinois Institute of Technology, Chicago, USA, with a 4.0 GPA. I also received a Master’s degree in Data Science, where my thesis focused on providing a model for optimizing customer value at the contact center. Additionally, I completed a Bachelor's degree in Computer Software Engineering.