Welcome to my personal website!
I currently serve as a Software Engineer at Databricks Inc. working on the Apache Spark team to advance one of the fastest and most scalable data engines in the world. I also serve as an Adjunct Assistant Professor at UMass Amherst.
My research focuses on big data management, data-processing systems, and machine learning systems. I take a systems-driven approach by co-designing the key components of modern data-intensive pipelines, including workflow engines, UDF debugging frameworks, pipelining optimizers, and machine learning acceleration systems for streaming data. To optimize performance, usability, and scalability, I integrate techniques across data management, distributed systems, program analysis, and machine learning.
I have contributed extensively to the Apache Texera (Incubating) project, a collaborative and interactive system for data science and AI/ML using workflows. My research has been published in database venues such as SIGMOD, VLDB and ICDE, and my interdisciplinary work spans venues including TOCHI, PNAS Nexus, JAMIA, AMIA, and PLOS ONE.
Education
-
Ph.D. of Computer Science
Sep 2019 - Aug 2025 -
B.S. of Computer Science
Sep 2015 - Jun 2019
Work
-
Software Engineer
Aug 2025 - Present · 7 mosWorking on PySpark. -
Adjunct Assistant Professor
May 2025 - Present · 10 mosServing in adjunct capacity. -
Software Engineer Intern
Jun 2024 - Sep 2024 · 4 mosDataset transformer, Log analytics, Streaming windows, Snowflake Time Travel -
Research Intern
Jun 2022 - Sep 2022 · 3 mosReal-time window aggregation, Out-of-order events, In-memory data structures, Flink optimization -
Research Intern
Jun 2020 - Sep 2020 · 3 mosHTAP database, Real-time query processing, MySQL-to-Kudu schema conversion, Lock-free heap structure
Awards
2026
- New!
UCI CS Best Dissertation Honorable Mention Award
University of California, Irvine
2025
-
Most Promising Future Faculty Award
University of California, Irvine
Received at the 33rd UCI Teach Day Celebration of Teaching.
-
Beall Family Foundation Graduate Student Entrepreneur Award in Computer Science
University of California, Irvine
-
Joseph & Dorothy Fischer Memorial Endowed Fellowship
University of California, Irvine
2024
-
Graduate Dean’s Dissertation Fellowship
University of California, Irvine
Received a prestigious dissertation fellowship recognizing a highly impactful thesis.
-
SIGMOD 2024 Best Demo Runner-Up Award
SIGMOD 2024
-
Student Travel Award (SIGMOD)
SIGMOD 2024
2023
-
Public Impact Fellowship
University of California, Irvine
-
Student Travel Award (VLDB)
VLDB 2023
2020
-
Best Lecturer Award
CUCS
Recognized for excellence in teaching performance.
Selected Publications (All)
2025
- DSE-K12DS4ALL: Teaching High-School Students Data Science and AI/ML Using the Texera Workflow Platform as a ServiceIn Data Science Education K-12: Research to Practice Annual Conference, Feb 2025
2024
-
-
- fncirBrain image data processing using collaborative data workflows on TexeraFrontiers in Neural Circuits, Feb 2024