Research

Harvard Aging Brain Study (Summer 2024)

Served as a research intern on the Data Analytics team for the Harvard Aging Brain Study . Contributed to the initiation and development of a machine learning project focused on cross-modality MRI synthesis (T1-T2), while supporting the team’s broader data processing and statistical analyses.

My Contributions


Perception Control & Cognition Lab (May 2023-May 2024)

Conducted research in computational social science investigating whether large language models (LLMs) can simulate human behavior when conditioned on demographic attributes such as age, race, education, religion, ideology, and income, building upon prior lab work published in Out of One, Many: Using Language Models to Simulate Human Samples (2022).

My Contributions

Primary Study: Mimicking Conversational Dynamics

Results

Bar Chart of donation frequency, split by demographics
Figure 1: Donation frequency by demographics. Revised prompting closely matches the real human distribution, while initial prompting consistently overestimates donation likelihood.

Key Findings

Additional Research


Interdisciplinary Computational Health Sciences (Nov 2023 – Apr 2024)

Collaborated in an interdisciplinary research team of Computer Science, Statistics, and Public Health faculty and students to investigate large-scale public health patterns using machine learning and statistical modeling.

My Contributions

Primary Study: Predicting Maternal Health Complications

Results

Model Performance Comparison Bar Chart (Precision, Recall, and Accuracy)
Figure 1: Comparison of model performance (accuracy, precision, recall) for predicting maternal health complications. Models showed similar accuracy, high recall, and low precision, reflecting class imbalance and the emphasis on minimizing false negatives.
Confusion Matrix of Logistic Regression model
Figure 2: Confusion matrix for the Logistic Regression model predicting maternal health complications. The model achieved high recall, identifying most complication cases, with class imbalance reflected in the large number of negative cases.

Key Findings

Additional Research


Capstone-level R&D project (Sept 2023-Apr 2024)

Developed a web application for scripture study featuring cross-lingual semantic search using OpenAI text embeddings. Embedded multilingual queries and aggregated cosine similarity scores to improve passage retrieval and recommendations.

My Contributions

Key Findings

Scatter plot of english vs mandarin cosine similarities, two passages shown that are high in both
Figure 1: Cosine similarity of Scripture passages to a compassion-related query in English (x-axis) and Mandarin (y-axis). Each point represents a passage; highlighted results show top matches with high cross-lingual semantic similarity.