Jul 2025
10 min

A practical guide to building real-time project tracking dashboards using Streamlit, Plotly, and Pandas. Covers data ingestion from Google Sheets, multi-tab layouts, interactive filtering, KPI cards, progress charts, styled DataFrames, and deployment patterns.
Jun 2025
12 min

A step-by-step guide to programmatically managing GitHub Projects V2 using the GraphQL API. Covers querying fields, updating single-select and text values, adding issues, posting automated comments, running on GitHub Actions, and building idempotent pipelines. All code is production-ready Python you can copy and adapt.
May 2025
8 min

This file documents the model development process and evaluation analysis approaches that might be reapplied in the other studies as well.
Mar 2025
5 min

An exploration of how parental income, race, and neighborhood characteristics shape children's economic outcomes in adulthood, using the Opportunity Atlas data.
Mar 2025
8 min

A replication and extension of McCabe et al. (2024, Nature), examining how Twitter's post-January 6th deplatforming of 70,000 accounts affected the spread of misinformation. Applies Difference-in-Differences (DID) and Sharp Regression Discontinuity (SRD) to panel data from 500,000+ users to assess causal impacts on misinformation reach.
Mar 2024
8 min

GDP is a key measure of economic performance, but its quarterly release cycle limits timely tracking. This paper explores the link between mid-to-high frequency indicators and GDP growth, using Shenzhen as a case study. PMI, CCI, and industrial value-added were selected based on data availability and timeliness. Using national-level data to infer local growth proved feasible, especially post-2005. Directional prediction showed moderate results (PMI around 60%), though numerical prediction from limited indicators fell short of expectations due to the complexity of the economic system. Further work on data processing and model refinement is needed.
Sep 2023
10 min

Public data has become a critical production factor in China's digital economy, and cities across the country are now pilot-testing ways to open it up responsibly. This piece examines the authorization and operation models that Zhejiang, Qingdao, Beijing, and Changsha have adopted, each with distinct approaches: from tightly scoped, single-use permits to broader platform-based schemes where operators can build and sell data products. It unpacks the key steps in the process (who authorizes what, how operators get selected, what safeguards exist) and maps out the core players: government agencies, licensed operators, data source departments, and downstream users. The goal is to present a practical overview of where public data authorization stands today and what the main design tradeoffs look like across these models.
May 2023
10 min

This study applies Network Analysis to track the relationship between community mobility and crime rates using Big Data from New York TLC Trip Record. Both Linear Regression and Exponential Random Graph Model (ERGM) are used to study interrelations between community mobility and crime rates, revealing crime patterns that diffuse beyond adjacent neighborhoods.
Dec 2022
8 min

A comparative analysis of how the US and UK regulate biometric data collection and use, examining differences in legislative frameworks, enforcement mechanisms, and privacy protections across the two jurisdictions.
Nov 2022
7 min

When the Fed speaks, markets listen. But it turns out what moves Treasury yields isn't whether the tone is positive or negative, it's how uncertain the language sounds. This text analysis of FOMC statements and minutes over 16 years reveals that the uncertainty embedded in Fed communications has a statistically significant link to bond yields across maturities, while overall sentiment scores tell a different story: they predict volatility, not direction.
Page 1 of 1
Next