Archival Extraction & Recipe Corpus Formation
The archival anchor for this project was Shiu Wong Chan’s The Chinese Cook Book (1917), a foundational text in the history of Chinese cuisine in the Western world. Published in New York, Chan’s work represents one of the earliest comprehensive attempts to systematize and translate Chinese culinary practice for an American audience. As a historical touchstone, it is positioned at the intersection of two cultures and provides a baseline against which to measure subsequent evolutions in the representation of Chinese cooking. I first completed a full survey of the text and generated a dataset using Google Sheets of all recipes that contained complete ingredient lists. This resulted in a working corpus of 143 recipes, excluding preliminary instructional entries, such as how to make peanut oil or cook rice, which contained rudimentary or incomparable structures for analysis.
For each recipe, I manually extracted metadata, including:
- Recipe ID
- Dish Name (in English, Chinese, and romanized)
- Dish Genre (soup, noodle, chicken, etc.)
- Author
- Ingredients
- Page Number
This step allowed the cookbook to function as both a culinary archive and cultural document, reflecting early 20th-century negotiations of authenticity, legibility, and assimilation in Chinese-American food.

