Data Management

Toward Identifying Values and Tensions in Designing a Historically-Sensitive Data Platform: A Case of the Urban Renewal Project

Urban renewal was a national initiative from 1950s through 70s aimed at improving so-called “blighted” areas, and resulted in the displacement of many vibrant communities. While the underlying mechanisms of urban renewal have been examined, there have been very few data-driven, evidence-based studies that take into account the histories and interests of former residents. The “Human Face of Big Data” project started as a digital curation effort to design and develop a web-based, big data platform that provides insights and analytics into the mechanisms of this process.

A Tool for Estimating and Visualizing Poverty Maps

"Poverty maps" are designed to simultaneously display the spatial distribution of welfare and different dimensions of poverty determinants. The plotting of such information on maps heavily relies on data that is collected through infrequent national household surveys and censuses. However, due to the high cost associated with this type of data collection process, poverty maps are often inaccurate in capturing the current deprivation status.

Human Face of Big Data

This project aims to build a map-based platform that can be used in presenting historical documents of the nation-wide, urban renewal project in 1960's and 70's to provide easy-to-use interfaces that can be used by former residents, archivists, reserachrs, and citizens; and ultimately to reconstruct a virtual neighborhood where people can share their memories.

Mapping Inequality

Over the past 4 years, teams of archivists have digitized portions of National Archives RG195: General Records of the Home Owners’ Loan Corporation [HOLC]. The records include surveys, memorandums, and maps of American neighborhoods in the 1930’s. Now that the records are available digitally, students and faculty members are working to curate the collection and “data-fy” the information contained within. This involves extracting text from digitized 75 year old records, designing queryable databases to house the information, and disseminating the information so it is publically accessible.

Toward an Ecology Theory of Creativity in IT Products: A Study of Mobile Device Industry

In a creative process, divergent thinking needs to be stimulated to generate novel ideas; yet these ideas must be synthesized to produce something valuable. Hence to foster creativity in developing IT products, creators need to manage the tension between novelty and value. Since the forces affecting the novelty-value tension often exist outside a creator's group or organization, we apply organizational ecology theory to propose an industry-level, ecological model for understanding the novelty of IT products.

Image Retrieval Systems based on Color Similarity and Edge Detection

This project was conducted in 2008 for my bachelor's thesis in the department of Electrical Engineering at Seoul National University (it was more like a capston project rather than a thesis, since the focus of the project was mainly at implementing algorithms rather than analyzing the performance of algorithms using concrete measures, e.g., recall and precision). I implemented an image retrieval system prototype that takes an image as input, and outputs most similar images from the image database.

Open Data Impact Map

The Center for Open Data Enterprise is a non-profit organization that aims to maximize the value of open data as a public resource that anyone can use. As a means to promote the impact and value of using open data, the center designed and developed the Open Data Impact map. As a Data Science & Technology Fellow at the Center for Open Data Enterprise, I have worked on the Open Data Impact Map, which is a searchable, centralized database of open data use cases from around the world. The map shows the distribution of organizations in the world that make use of open data.

On Local Information Landscapes and Information Deserts

Information accessibility problems include diverse types of human- and system-driven barriers that make it difficult for individuals to access desired information. These issues have been studied in two main streams: (1) a human-centered view based on the understand­ing of individual-level characteristics such as physical impairment and economic status; and (2) a technology-focused view that emphasizes on system factors such as the information filtering techniques and interface designs.

Wireless Price Display System

This project was my final team project for "Electrical Engineering Laboratory 3" class in 2004 (when I was junior in college). The goal was to make a wireless system that substitutes price tags with electrical displays in grocery stores. The base station manages a list of products in the database, and it sends out price information to receiver units. Receiver units are simple character displays having unique IDs. When any products' prices change, the system can easily update new prices on these displays. 

Drupal Course Management Module for UMD Websites

This project was designed to provide most up-to-date courses information to University of Maryland College of Arts and Humanities (ARHU) websites. Since most ARHU websites were implemented with Drupal (Content Management System), a Drupal module was developed to migrate the university's course catalogs into Drupal databases. For crawling the course catalogs, Brady Law (CS student)'s Python scripts were modified and re-implemented.