From the Lab Bench to the Living Room: A History of Product Testing

Manan Beg24 May 20264 min read

In this piece

What the Military Needed and Why It Mattered
Peryam, Pilgrim, and the Scale That Stuck
Rose Marie Pangborn and the UC Davis Contribution
From the Lab Bench to the Living Room in Practice
Frequently Asked Questions
What's the difference between hedonic scales and traditional preference measurement approaches?
Which sensory science principles from the 1940s, 1960s still guide consumer testing today?

The lab bench living room is not a metaphor. It describes a real migration: the controlled sensory experiments of the 1940s military research labs moving into kitchens, pantries, and dining tables to study how ordinary people experience products. That migration built the foundations of modern consumer product testing, and the methods it produced still run underneath concept tests, pack tests, and flavor evaluations today.

Key Takeaways

The US military's ration research program in the 1940s created the first systematic framework for measuring food acceptance at scale.

Peryam and Pilgrim published the 9-point hedonic scale in 1957, giving researchers a standardized tool that remains the dominant preference measurement instrument in consumer testing.

Rose Marie Pangborn at UC Davis elevated sensory science from military logistics into a rigorous academic discipline with direct commercial application.

The core insight from this era (that liking must be measured separately from sensory attribute detection) remains the organizing principle of product evaluation today.

What the Military Needed and Why It Mattered

In 1943, the US Army Quartermaster Corps had a concrete problem: soldiers were leaving food in the field. Not because they were full. Because the rations tasted bad. The Corps contracted researchers to find out what "acceptable" meant when 18-year-olds from Brooklyn and rural Georgia had to eat the same product in a foxhole.

That mandate produced something the food industry had never built: a systematic method for measuring hedonic response, not whether a person could detect a flavor difference, but whether they liked what they tasted. The distinction sounds obvious now. In 1943, most food evaluation was either expert sensory panels rating attribute intensity or informal feedback. The military program forced researchers to separate the two questions, and that separation is still the organizing principle of every product test worth running.

Peryam, Pilgrim, and the Scale That Stuck

David Peryam and Frank Pilgrim, working at the Quartermaster Food and Container Institute in Chicago, published their formalization of the 9-point hedonic scale in 1957. The scale ran from "dislike extremely" to "like extremely," with a neutral midpoint. What made it durable was the validation work behind it: Peryam and Pilgrim tested the scale's reliability across different populations, products, and conditions before publishing.

The scale remains the most widely used single instrument in consumer product testing more than sixty years after publication, a longevity documented in the Journal of Sensory Studies. The 9-point structure survived because it balanced discrimination (enough points to detect real preference differences) with cognitive manageability, respondents don't reliably distinguish more than nine steps on a liking dimension. Every category team running a sequential monadic test today is working with architecture Peryam and Pilgrim built from military commissary data.

From the practitioners

Run your next study on Enumerate.

See how Enumerate works on a study like yours. Book a 30-minute demo and we'll walk you through it.

Book a demo

Tailored to your use case

Rose Marie Pangborn and the UC Davis Contribution

Rose Marie Pangborn joined UC Davis in the late 1950s and spent three decades making sensory science into a proper academic discipline. Where the Quartermaster research was applied and logistical, Pangborn's work was systematic and theoretical: she studied the psychophysics of taste, the interaction between sensory modalities, and the cognitive processes behind food preference. Her 1965 monograph on flavor research became a foundational text for the field. Pangborn established that consumer liking is not a simple readout of sensory input.

Context, expectation, memory, and cross-modal effects all shape what a person reports. A product that scores well in a controlled lab setting can fail in home use because the lab stripped out the environmental signals that normally frame the experience. This is why diary studies (products evaluated in real use contexts over time) eventually became essential alongside the bench test. Pangborn laid the intellectual groundwork that made that evolution necessary.

From the Lab Bench to the Living Room in Practice

The migration from lab to home use happened incrementally as researchers noticed the gap between controlled-environment scores and in-market performance. A snack food rating 7.2 on the hedonic scale in a sensory booth would consistently underperform in household trials. The lab controlled for confounds that turned out to be meaningful: lighting, ambient smell, the presence of other foods. Central location testing (CLT) became the compromise, controlled enough for comparable data, naturalistic enough to avoid the worst distortions.

In-home use tests (IHUTs) followed for categories where habitual use was the relevant context. Video diary research and AI-moderated longitudinal studies extend that logic further, capturing hedonic ratings alongside the texture of real usage over days and weeks, something Enumerate's AI moderator handles at a scale no single-session CLT was built to reach. The principles Peryam, Pilgrim, and Pangborn established (separate detection from liking, control context deliberately, validate your scale) still hold. The setting has just moved from the Chicago commissary to the respondent's kitchen.

Want to see how Enumerate runs in-context product evaluations at speed? Book a demo with Enumerate.

Frequently Asked Questions

What's the difference between hedonic scales and traditional preference measurement approaches?

Traditional preference measurement asked respondents to choose between products, A versus B, forced choice. Hedonic scales measure absolute liking on a graded continuum, so a researcher can assess whether either product clears an acceptable threshold, not just which one wins the head-to-head. The 9-point scale Peryam and Pilgrim published in 1957 made that absolute measurement standardized and comparable across studies.

Which sensory science principles from the 1940s, 1960s still guide consumer testing today?

Three hold: separate hedonic response (liking) from sensory attribute detection, validate your measurement instrument before trusting the data, and account for context effects when designing the test environment. The methods built around these principles (CLTs, IHUTs, sequential monadic designs) all derive from decisions made in the Quartermaster Labs and at UC Davis between 1943 and the mid-1960s.

From the Lab Bench to the Living Room: A History of Product Testing

What the Military Needed and Why It Mattered

Peryam, Pilgrim, and the Scale That Stuck

Rose Marie Pangborn and the UC Davis Contribution

From the Lab Bench to the Living Room in Practice

Frequently Asked Questions

What's the difference between hedonic scales and traditional preference measurement approaches?

Which sensory science principles from the 1940s, 1960s still guide consumer testing today?

Related Reading

The New Shopper Geography: Quick Commerce, Live Commerce, Social Commerce

Procter & Gamble's First Moment of Truth: The Three-to-Seven-Second Window That Reorganized a $70B Company

A Short History of Watching People Shop

From the Lab Bench to the Living Room: A History of Product Testing

What the Military Needed and Why It Mattered

Peryam, Pilgrim, and the Scale That Stuck

Rose Marie Pangborn and the UC Davis Contribution

From the Lab Bench to the Living Room in Practice

Frequently Asked Questions

What's the difference between hedonic scales and traditional preference measurement approaches?

Which sensory science principles from the 1940s, 1960s still guide consumer testing today?

Related Reading

The New Shopper Geography: Quick Commerce, Live Commerce, Social Commerce

Procter & Gamble's First Moment of Truth: The Three-to-Seven-Second Window That Reorganized a $70B Company

A Short History of Watching People Shop