Big Data

A Tool for Estimating and Visualizing Poverty Maps

"Poverty maps" are designed to simultaneously display the spatial distribution of welfare and different dimensions of poverty determinants. The plotting of such information on maps heavily relies on data that is collected through infrequent national household surveys and censuses. However, due to the high cost associated with this type of data collection process, poverty maps are often inaccurate in capturing the current deprivation status.

MapReduce Framework for Swarm Robot Systems

This was a part of research projects conducted under the Basic Research Laboratory Grant from Ministry of Education, Science, and Technology in South Korea. This project is two folds: (1) simulating an application of swarm robot systems; (2) designing a software framework for the swarm robot systems to reduce the complexity of developing applications while minimizing the amount of tranismitted data by adopting MapReduce paradigm. The video above is a simulation of a swarm robot system applicatoin that searches for red pillars (foraging).

Large-scale News Image Analysis with MapReduce-based LSH and VisualRank

Hao Li (Ph.D. student from CS) and I conducted a big-data analysis project using the MapReduce framework (Hadoop) for the final project of INFM718G (Data-Intensive Computing with MapReduce, by Dr. Jimmy Lin). Targeting all the news images in April 2013, we tried to rank news images based on the imporance and popularity level of each news image. In order to do that, we extracted image features using SIFT (Scale-invariant feature transform) and constructred a graph of images using LSH (Locality-sensitive Hashing) as a means to approximate the simliarity of images.