Image Processing

An Evaluation of GPT-4V for Transcribing the Urban Renewal Hand-Written Collection

In November 2024, OpenAI released GPT- 4V(ision), which includes Optical Character Recognition (OCR) capabilities. Given that much of the data curation, processing, and cleaning can be managed through user-friendly prompts (i.e., chat), we aim to conduct an initial assessment of GPT-4V’s effectiveness in transcribing hand-written documents from the urban renewal collection. If GPT-4V can accurately digitize hand-written documents through carefully crafted prompts, it could become a valuable tool for nonexperts in transcribing historical documents on a large scale.

Fundamental Technologies for the Multi-Scale Mass-Deployable Cooperative Robots

"Multi-scale mass-deployable cooperative robots are a next-generation robotics paradigm where a large number of robots that vary in size cooperate in a hierarchical fashion to collect information in various environments. While this paradigm can exhibit the effective solution for exploration of the wide area consisting of various types of terrain, its technical maturity is still in its infant stage and many technical hurdles should be resolved to realize this paradigm."  (Choo et al., 2013)

MapReduce Framework for Swarm Robot Systems

This was a part of research projects conducted under the Basic Research Laboratory Grant from Ministry of Education, Science, and Technology in South Korea. This project is two folds: (1) simulating an application of swarm robot systems; (2) designing a software framework for the swarm robot systems to reduce the complexity of developing applications while minimizing the amount of transmitted data by adopting MapReduce paradigm. The video above is a simulation of a swarm robot system application that searches for red pillars (foraging).

Image Retrieval Systems based on Color Similarity and Edge Detection

This project was conducted in 2008 for my bachelor's thesis in the Department of Electrical Engineering at Seoul National University (it was more like a capstone project rather than a thesis, since the focus of the project was mainly at implementing algorithms rather than analyzing the performance of algorithms using concrete measures, e.g., recall and precision). I implemented an image retrieval system prototype that takes an image as input, and outputs most similar images from the image database.