Food, wine and lists.

Tue Nov 17

A Hint of Hype, A Taste of Illusion

The Wall Street Journal, November 14, 2009

They pour, sip and, with passion and snobbery, glorify or doom wines. But studies say the wine-rating system is badly flawed. How the experts fare against a coin toss.

By LEONARD MLODINOW

Acting on an informant’s tip, in June 1973, French tax inspectors barged into the offices of the 155-year-old Cruse et Fils Frères wine shippers. Eighteen men were eventually prosecuted by the French government, accused, among other things, of passing off humble wines from the Languedoc region as the noble and five-times-as-costly wine of Bordeaux. During the trial it came out that the Bordeaux wine merchants regularly defrauded foreigners. One vat of wine considered extremely inferior, for example, was labeled “Salable as Beaujolais to Americans.”

It was in this climate that in the 1970s a lawyer-turned-wine-critic named Robert M. Parker Jr. decided to aid consumers by assigning wines a grade on a 100-point scale. Today, critics like Mr. Parker exert enormous influence. The medals won at the 29 major U.S. wine competitions medals are considered so influential that wineries spend well over $1 million each year in entry fees. According to a 2001 study of Bordeaux wines, a one-point bump in Robert Parker’s wine ratings averages equates to a 7% increase in price, and the price difference can be much greater at the high end.

Given the high price of wine and the enormous number of choices, a system in which industry experts comb through the forest of wines, judge them, and offer consumers the meaningful shortcut of medals and ratings makes sense.

But what if the successive judgments of the same wine, by the same wine expert, vary so widely that the ratings and medals on which wines base their reputations are merely a powerful illusion? That is the conclusion reached in two recent papers in the Journal of Wine Economics.

Both articles were authored by the same man, a unique blend of winemaker, scientist and statistician. The unlikely revolutionary is a soft-spoken fellow named Robert Hodgson, a retired professor who taught statistics at Humboldt State University. Since 1976, Mr. Hodgson has also been the proprietor of Fieldbrook Winery, a small operation that puts out about 10 wines each year, selling 1,500 cases

A few years ago, Mr. Hodgson began wondering how wines, such as his own, can win a gold medal at one competition, and “end up in the pooper” at others. He decided to take a course in wine judging, and met G.M “Pooch” Pucilowski, chief judge at the California State Fair wine competition, North America’s oldest and most prestigious. Mr. Hodgson joined the Wine Competition’s advisory board, and eventually “begged” to run a controlled scientific study of the tastings, conducted in the same manner as the real-world tastings. The board agreed, but expected the results to be kept confidential.

There is a rich history of scientific research questioning whether wine experts can really make the fine taste distinctions they claim. For example, a 1996 study in the Journal of Experimental Psychology showed that even flavor-trained professionals cannot reliably identify more than three or four components in a mixture, although wine critics regularly report tasting six or more. There are eight in this description, from The Wine News, as quoted on wine.com, of a Silverado Limited Reserve Cabernet Sauvignon 2005 that sells for more than $100 a bottle: “Dusty, chalky scents followed by mint, plum, tobacco and leather. Tasty cherry with smoky oak accents…” Another publication, The Wine Advocate, describes a wine as having “promising aromas of lavender, roasted herbs, blueberries, and black currants.” What is striking about this pair of descriptions is that, although they are very different, they are descriptions of the same Cabernet. One taster lists eight flavors and scents, the other four, and not one of them coincide.

That wine critiques are peppered with such inconsistencies is exactly what the laboratory experiments would lead you to expect. In fact, about 20 years ago, when a Harvard psychologist asked an ensemble of experts to rank five wines on each of 12 characteristics—such as tannins, sweetness, and fruitiness—the experts agreed at a level significantly better than chance on only three of the 12.

Psychologists have also been skeptical of wine judgments because context and expectation influence the perception of taste. In a 1963 study at the University of California at Davis, researchers secretly added color to a dry white wine to simulate a sauterne, sherry, rosé, Bordeaux and burgundy, and then asked experts to rate the sweetness of the various wines. Their sweetness judgments reflected the type of wine they thought they were drinking. In France, a decade ago a wine researcher named Fréderic Brochet served 57 French wine experts two identical midrange Bordeaux wines, one in an expensive Grand Cru bottle, the other accommodated in the bottle of a cheap table wine. The gurus showed a significant preference for the Grand Cru bottle, employing adjectives like “excellent” more often for the Grand Cru, and “unbalanced,” and “flat” more often for the table wine.

Provocative as they are, such studies have been easy for wine critics to dismiss. Some were small-scale and theoretical. Many were performed in artificial laboratory conditions, or failed to control important environmental factors. And none of the rigorous studies tested the actual wine experts whose judgments you see in magazines and marketing materials. But Mr. Hodgson’s research was different.

In his first study, each year, for four years, Mr. Hodgson served actual panels of California State Fair Wine Competition judges—some 70 judges each year—about 100 wines over a two-day period. He employed the same blind tasting process as the actual competition. In Mr. Hodgson’s study, however, every wine was presented to each judge three different times, each time drawn from the same bottle.

The results astonished Mr. Hodgson. The judges’ wine ratings typically varied by ±4 points on a standard ratings scale running from 80 to 100. A wine rated 91 on one tasting would often be rated an 87 or 95 on the next. Some of the judges did much worse, and only about one in 10 regularly rated the same wine within a range of ±2 points.

Mr. Hodgson also found that the judges whose ratings were most consistent in any given year landed in the middle of the pack in other years, suggesting that their consistent performance that year had simply been due to chance.

Mr. Hodgson said he wrote up his findings each year and asked the board for permission to publish the results; each year, they said no. Finally, the board relented—according to Mr. Hodgson, on a close vote—and the study appeared in January in the Journal of Wine Economics.

“I’m happy we did the study,” said Mr. Pucilowski, “though I’m not exactly happy with the results. We have the best judges, but maybe we humans are not as good as we say we are.”

This September, Mr. Hodgson dropped his other bombshell. This time, from a private newsletter called The California Grapevine, he obtained the complete records of wine competitions, listing not only which wines won medals, but which did not. Mr. Hodgson told me that when he started playing with the data he “noticed that the probability that a wine which won a gold medal in one competition would win nothing in others was high.” The medals seemed to be spread around at random, with each wine having about a 9% chance of winning a gold medal in any given competition.

To test that idea, Mr. Hodgson restricted his attention to wines entering a certain number of competitions, say five. Then he made a bar graph of the number of wines winning 0, 1, 2, etc. gold medals in those competitions. The graph was nearly identical to the one you’d get if you simply made five flips of a coin weighted to land on heads with a probability of 9%. The distribution of medals, he wrote, “mirrors what might be expected should a gold medal be awarded by chance alone.”

Mr. Hodgson’s work was publicly dismissed as an absurdity by one wine expert, and “hogwash” by another. But among wine makers, the reaction was different. “I’m not surprised,” said Bob Cabral, wine maker at critically acclaimed Williams-Selyem Winery in Sonoma County. In Mr. Cabral’s view, wine ratings are influenced by uncontrolled factors such as the time of day, the number of hours since the taster last ate and the other wines in the lineup. He also says critics taste too many wines in too short a time. As a result, he says, “I would expect a taster’s rating of the same wine to vary by at least three, four, five points from tasting to tasting.”

Francesco Grande, a vintner whose family started making wine in 1827 Italy, told me of a friend at a well-known Paso Robles winery who had conducted his own test, sending the same wine to a wine competition under three different labels. Two of the identical samples were rejected, he said, “one with the comment ‘undrinkable.’ ” The third bottle was awarded a double gold medal. “Email Robert Parker,” he suggested, “and ask him to submit to a controlled blind tasting.”

I did email Mr. Parker, and was amazed when he responded that he, too, did not find Mr. Hodgson’s results surprising. “I generally stay within a three-point deviation,” he wrote. And though he didn’t agree to Mr. Grande’s challenge, he sent me the results of a blind tasting in which he did participate.

The tasting was at Executive Wine Seminars in New York, and consisted of three flights of five wines each. The participants knew they were 2005 Bordeaux wines that Mr. Parker had previously rated for an issue of The Wine Advocate. Though they didn’t know which wine was which, they were provided with a list of the 15 wines, with Mr. Parker’s prior ratings, according to Executive Wine Seminars’ managing partner Howard Kaplan. The wines were chosen, Mr. Kaplan says, because they were 15 of Mr. Parker’s highest-rated from that vintage.

Mr. Parker pointed out that, except in three cases, his second rating for each wine fell “within a 2-3 point deviation” of his first. That’s less variation than Mr. Hodgson found. One possible reason: Mr. Parker’s first rating of all the wines fell between 95 and 100—not a large spread.

One critic who recognizes that variation is an issue is Joshua Greene, editor and publisher of Wine and Spirits, who told me, “It is absurd for people to expect consistency in a taster’s ratings. We’re not robots.” In the Cruse trial, the company appealed to the idea that even experienced tasters could err. Cruse claimed that it had bought the cheap Languedoc believing it was the kingly Bordeaux, and that the company’s highly-trained and well-paid wine tasters had failed to perceive that it wasn’t. The French rejected that possibility, and 35 years ago this December, eight wine dealers were convicted and given prison terms and fines totaling $8 million.

Despite his studies, Mr. Hodgson is betting that, like the French, American consumers won’t be easily converted to the idea that wine experts are fallible. His winery’s Web site still boasts of his own many dozens of medals.

“Even though ratings of individual wines are meaningless, people think they are useful,” Mr. Greene says. He adds, however, that one can look at the average ratings of a spectrum of wines from a certain producer, region or year to identify useful trends.

As a consumer, accepting that one taster’s tobacco and leather is another’s blueberries and currants, that a 91 and a 96 rating are interchangeable, or that a wine winning a gold medal in one competition is likely thrown in the pooper in others presents a challenge. If you ignore the web of medals and ratings, how do you decide where to spend your money?

One answer would be to do more experimenting, and to be more price-sensitive, refusing to pay for medals and ratings points. Another tack is to continue to rely on the medals and ratings, adopting an approach often attributed to physicist Neils Bohr, who was said to have had a horseshoe hanging over his office door for good luck. When asked how a physicist could believe in such things, he said, “I am told it works even if you don’t believe in it.” Or you could just shrug and embrace the attitude of Julia Child, who, when asked what was her favorite wine, replied “gin.”

As for me, I have always believed in the advice given by famed food critic Waverly Root, who recommended that one simply “Drink wine every day, at lunch and dinner, and the rest will take care of itself.”

—Leonard Mlodinow teaches randomness at Caltech. His most recent book is “The Drunkard’s Walk: How Randomness Rules Our Lives.”

Wed Aug 12

Last Supper

Mike - Junk Burger at Mountain Sun Brewery (Boulder) w/Jalapeno Mayo and Hand Cut Fries - Nitrogen-driven Colorado Kind Pale Ale (same as my birthday meal each year).

Travis -

Breakfast - Jailhouse Cafe, Moab - a side of Soul-Food bacon and a cup of Blue Mountain coffee and bloody mary

Lunch - Arthur Bryant’s, Kansas City - Brisket and a Budweiser

Lunch #2 - Angelo’s, Denver - A small pepperoni and black olive pizza with an EZ Street

Dinner - Thurston’s Lobster Pound, Bernard Maine - Lobster, Slaw, Roll, Blueberry Cake, Vinho Verde (whether it goes or not)

Desert - A glass of port, sex in a king-size bed, and an Alka Seltzer with a Tylenol PM chaser

Last Meal (a day-long event) -


Dish
A black olive pizza, homemade, with a great bottle of wine (whatever S tells me to drink). Also, while I’m making the pizza, I’m drinking Fat Tire (and a Levity, an EZ Street, an Avalanche) and snacking on a cheese and salami plate. Also, after I eat the pizza and I’m cleaning up, there is a bowl of ranier cherries and a couple glasses of port available. I wouldn’t turn down the sex and Alka Seltzer either.

Scott -

Meal

-Caviar (1990 Krug Champagne)

-Toro Sushi (ibid)

-Fois Gras (1988 Chateau d’Yquem)

-Bacon & Egg Salad (like the one at Bistro Vendome; no wine)

-Poached Lobster (1995 Boillot Chevalier Montrachet)

-Truffled Roast Chicken (1969 LaTache)

-Stilton (1963 Fonseca Vintage Port)

-Rhum Baba

Dish

Really good bread

Really good butter

Red Burgundy

Breakfast

Fresh squeezed Texas ruby red grapefruit from the Four Seasons in Austin

Logan biscuits (proprietary recipe)

Genny’s cheese grits

Country ham

Scrambled eggs

Fresh-cut Maui Gold pineapple

Lunch(es)

1 - cheese pizza from Arturo’s

2 - chili dog with Dr. Pepper from the Dublin, Texas plant

3 - pastrami sandwich from 2nd Avenue Deli or Katz’s with a Dr. Brown’s Black Cherry soda

Dale - Minestrone - no particular recipie, in fact the more kitchen sink the better (no red kidney beans) and a couple bottles of something white.
Hand rolled pasta with potatoes, green beans and fresh pesto sauce.
Osso Buco - the full leg bone in with large vegetable pieces and a Super Tuscan - or - striped sea bass baked in a salt dome with sage rosemary and a little lemon.
No desert. Just a glass of Port and the little cookies and chocolates.

Dish - Beef stew.

Thu Oct 30

Top Movies - Dale’s List

All time favorite franchise - James Bond.  22 films spanning four decades and 50+ Bond Girls (who could forget Honey Ryder, Plenty O’Toole, Pussy Galore, Dr. Christmas Jones, Xenia Onatopp, to mention just a few).  Now that is a franchise.  Favorites - From Russia With Love - 1963, You Only Live Twice - 1967, The Spy Who Loved Me - 1977, Tomorrow Never Dies - 1987 and Casino Royale - 2006.

  • Comedies… in no particular order.
    • Caddy Shack - 1980
    • Mash - 1970
    • Blazing Saddles – 1974
    • Airplane – 1980
    • The Birdcage – 1996
    • National Lampoons Animal House – 1978
    • Wedding Crashers - 2005
    • Spaceballs 1987
    • National Lampoons Vacation – 1983 and Christmas Vacation – 1989
    • There’s Something About Mary – 199
  • Others… in no particular order.
    • A River Runs Through It - 1992
    • Star Wars – 1977
    • Raiders of the Lost Ark – 1981
    • Rocky – 1986
    • Dirty Harry – 1971
    • The Dirty Dozen – 1967
    • Butch Cassidy and the Sundance Kid – 1969
    • Platoon – 1986
    • Unforgiven – 1992
    • Breaking Away - 1979
  1.  

    Mon Oct 6
    Sun Sep 21

    Wines - 9-20

    • 2006 Deidedheimer Paradiesgarten
      Reisling Kabinet Pfalz
    • 2006 Josef Rosch
      Reisling
    • 2006 Chessman Vineyards
      Mendocina County Pinot Noir
    • 2006 Gentil Hugel
      Alsace
    • 2005 Domaine Zind Humbrecht
      Pinot D’Alsace

    Sat Sep 20
    The Madsen’s table

    The Madsen’s table

    Fri Sep 19
    Soup’s cookin!

    Soup’s cookin!

    Thu Sep 18

    Saturday’s Menu

    1. Dates stuffed w/Goat Cheese
    2. Riesling Onion Soup
    3. Roast Sweet Onion Pork Tenderloin with Potato & Apple Gallette
    4. Pear Salad w/Artisanal Cheeses
    5. Apple Charlottes

    Mon Sep 8

    Best Food Experiences - Restaurant Edition

    Scott’s list:

    • Arturo’s
      NYC, January-98
      HS buddies
      Amazing day walking around the West Village, saw Bob Dylan Movie, went to Arturo’s for pizza. Great day, great pizza.
    • Bartolotta Ristorante di Mare
      Wynn Las Vegas, January-06
      Closing Dinner
      Fun group, but unbelievable cheese ravioli, a spectacular $700 fish, and more bottles of Brunello di Montalcino than I can remember.
    • Chinois
      Sant Monica, pick a time
      Genny
      Everytime we go we gush. Great Lobster, duck fried rice, haricot verts and wine.
    • Convenience Store
      Melbourne, FL, Some time in the 80’s
      Pa
      Sausage Biscuit with my grandfather, one of his favorite foods.
    • Frasca
      Boulder, December-07
      Supper Club
      Travis said it best. A night with great friends, a great meal, and an unbelievably beautiful snow fall.
    • Le Prevote
      Isle sur la Sorgue, France, July-98
      Genny, Mom & Dad, Halsteads
      Gorgeous patio with view of Sorgue River, unbelievable setting. Food and wine was good, but memory was the setting.
    • Le Timgad
      Paris, July-98
      Genny
      We were in Paris for about 36 hours. Went here for great Moroccan food; unique experience.
    • Nobu 57
      NYC, October-07
      solo
      Sunday night alone in midtown, decided to finally stop in and see what the fuss was about. First Otoro; best Otoro; best thing ever.
    • Peter Luger
      NYC, August-97
      ING Crew
      First time at Luger’s; best time at Luger’s; best steak I ever had.

    Best Food Experiences - Restaurant Edition

    Dale’s Contribution

    • Carini’s Pizzeria & Ristorante

    You will never in your life have a better grinder - eggplant, meatball and Italian sausage.  www.carinisristorante.com

    • Il Ritrovo

    Amazing… no fabulous… no supercalifragilisticexpialidocious antipasti bar and view of the Amalfi Coast.  www.ilritrovo.com

    • Grand Hotel Tremezzo Palace

    Actually, the little terrace restaurant ran by a couple of older ladies served the best Minestrone soup I have ever had.  www.tremezzopalace.com

    • Santi

    A great place to explore petit syrah.  www.tavernasanti.com

    • Castle Hill Inn & Resort

    Best meal bar none without the kids.  www.castlehillinn.com

    • Anthony’s Seafood

    You won’t believe the Rhode Island clam chowder.  www.anthonysseafood.com

    • Pappas Restaurants

    Best chain of all time.  www.pappas.com

    • Any old ice house in Houston 

    Cold beer and crawfish served on paper… that is livin.  www.bigjohnsicehouse.com

    • Dunton Hot Springs

    Best first class meal in Butch Cassidy’s watering hole.  www.duntonhotsprings.com

    • Lon’s at the Hermosa Inn

    Saved a guy’s life here once… food was outstanding.  www.hermosainn.com

    • Frank Pepe Pizzeria Napolentana

    Your mouth waters just thinking of it.  www.pepespizzeria.com

    • Gordon Ramsay at Claridge’s

    Most expensive meal and wine I have ever had: score 8.5.  www.gordonramsay.com/claridges

    • The Bubble Room

    Quite an impression… you have to see it once in your lifetime.  www.bubbleroomrestaurant.com

    A road trip that I would love to do - any takers?
    • Gary Danko or Michael Mina

    The next time I am in San Francisco.  www.garydanko.com www.michaelmina.net/michaelmina_sanfrancisco

    Local Favorites Befitting My Persona