what’s surprising?

A com­mon com­plaint of research is that it’s not “sur­pris­ing.” For exam­ple, a reviewer might say, “The study was well done, but the results weren’t really that sur­pris­ing.”, or, “I found the results a bit predictable.”

But what do these state­ments really mean? Do they mean, “Had you asked me the research ques­tion, I could have guessed the results with some degree of con­fi­dence.”? Or, “If you asked your research ques­tion of 100 experts, 95 of their guesses would have been right.”?

Maybe we might intend for them to mean that, but they don’t actu­ally cap­ture what hap­pens when a reviewer reads a paper. What usu­ally hap­pens is the reviewer reads the research ques­tion and thinks, “Hm, I could guess, but I’m not sure.” Then, upon read­ing the results, the reviewer thinks, “Well of course, that’s not sur­pris­ing at all.” The test exe­cuted here is not whether an expert can con­fi­dently pre­dict the answer to a research ques­tion, but whether in hind­sight it seems plau­si­ble that an expert could have guessed the result.

In this sense, what makes a result “sur­pris­ing” has less to do with what we know as sci­en­tists and more to do with what we think we know about what other researchers know.

This social fab­ric that appar­ently under­lies our judge­ments of what is known has other inter­est­ing effects on what is accepted as advanc­ing knowl­edge. For exam­ple, that some find­ing has not been pub­lished, is rarely a sat­is­fac­tory argu­ment for why some­thing should be pub­lished. What under­lies this belief it that it is not our goal as sci­en­tists to doc­u­ment every­thing that we know. Instead, it is our job to doc­u­ment the sub­set of what we know that is inter­est­ing, impor­tant, and surprising.

But aren’t most judge­ments of what is inter­est­ing and impor­tant are grounded in the present? How are we to know what is inter­est­ing or impor­tant in the future? Who are we to judge that the future of human­ity will find no inter­est in the unin­ter­est­ing, unim­por­tant results of today? Take, for exam­ple, a recent review I wrote on a paper about using mul­ti­touch, table­top dis­plays for engi­neer­ing design. I argued that it was unclear what prob­lem was being solved. But what if it solves a prob­lem that doesn’t exist yet? Or what if it solves it in such a way that another prob­lem I hadn’t even thought of becomes triv­ial? On what basis could I really judge whether the work would have future worth?

All of this makes me think I don’t give papers a fair shake. Maybe I’ll adopt a new review­ing pro­to­col: instead of read­ing the paper straight through and record­ing my thoughts, I’ll look at the authors’ research ques­tion and try to answer it myself for five min­utes. Then, I’ll read the paper and if they came up with a dif­fer­ent solu­tion or answer that mine (that is of course reli­able, sound, etc.), whether or not I’m sur­prised, the authors get credit for dis­cov­er­ing or invent­ing some­thing that I didn’t know. Of course, If I guessed their results or solu­tion in a mere five min­utes, what could they pos­si­bly have contributed?

5 thoughts on “what’s surprising?

  1. Hi Andy! This is a funny serendip­ity, as I was just think­ing about this prob­lem you men­tion: “But aren’t most judge­ments of what is inter­est­ing and impor­tant are grounded in the present? How are we to know what is inter­est­ing or impor­tant in the future? Who are we to judge that the future of human­ity will find no inter­est in the unin­ter­est­ing, unim­por­tant results of today?”

    I think the rec­i­p­ro­cal can be true of the our views on the past as well. I recently saw a rather ridicu­lous show on the His­tory chan­nel called “Ancient Aliens”, whose entire premise seemed to be founded on the assump­tion that all ancient knowl­edge must be sub­sumed by our present knowl­edge, so if the ancients appear to have known things that we can­not fig­ure out, they must have had divine or extrater­res­trial help. I wrote about this on my blog: http://www.apieceofeverything.com/blog/index.php/2010/04/27/the-wisdom-of-the-ancients/

    But I feel your pain, as most of my research work has been in rel­a­tively un-groundbreaking ter­ri­tory, caus­ing me to receive a lot of reviews to the effect of your “unsur­pris­ing” ones. It seems a haz­ard of doing research in tech­nol­ogy in the inter­net age. Peo­ple in HCI-oriented con­fer­ences want to pub­lish the next desk­top metaphor replace­ment or real­iza­tions of scifi inter­faces from ILM movies, not solid empir­i­cal inves­ti­ga­tions into (for exam­ple and in my case) the effects of hand­writ­ing recog­ni­tion accu­racy on user sat­is­fac­tion for var­i­ous tasks, which is seen as stodgy and boring.

    *Lisa

    • Thanks for your com­ment, Lisa. I haven’t thought about this much lately, but I have a bunch of VL/HCC and UIST reviews to do, so it’s com­ing up again. I do think we should think about the “stodgy and bor­ing” dimen­sion dif­fer­ently than the “sur­pris­ing” dimen­sion. There exist stud­ies that find things that were never known, but for whom no one cares (this makes me think of the pub­lic views of biol­o­gists cat­a­loging every pos­si­ble thing about some species of rain for­est toad). There also exist stud­ies about really impor­tant things that aren’t too sur­pris­ing (head­line: peo­ple still have a hard time quit­ting smok­ing). I guess most of this debate is about what we want to archive. These days, with infi­nite stor­age, why don’t we just archive every­thing that’s well writ­ten and well exe­cuted and flag the things we find sur­pris­ing and important?

  2. Another prob­lem of fil­ter­ing papers by the unex­pect­ed­ness of their find­ings is that it dis­cour­ages repli­ca­tion. If the study a paper reports is very well exe­cuted, and the paper itself is clear and well argued, and the topic under study could still use some fur­ther explo­ration (that is, the paper isn’t just con­firm­ing some­thing the com­mu­nity already takes as a fact), then I vote for accept­ing the paper. I do con­sider the ele­ment of sur­prise in my reviews, though, and a sur­pris­ing result that meets the rest of these cri­te­ria is cer­tainly a strong accept to me.

    I do like your pro­posal to give some thought to the research ques­tions in advance. I know of at least one per­son that does some­thing sim­i­lar (he writes his pre­dicted answers along­side the ques­tions), and this helps him focus his read­ing and, later on, his critique.

  3. Instead, it is our job to doc­u­ment the sub­set of what we know that is inter­est­ing, impor­tant, and surprising.”

    Not sure I agree. The cri­te­ria of inter­est­ing­ness or nov­elty might make a use­ful fil­ter for a con­fer­ence com­mit­tee — to ensure the pro­gram is of inter­est — but pre­sum­ably, just because you can ‘guess’ what the results are, isn’t the same as empir­i­cally val­i­dat­ing them. I mean, it isn’t sci­ence just because you have a hunch about the outcome.

    Let’s look at Fitts’s Law, which states that the big­ger and closer a UI ele­ment is, the quicker it is to nav­i­gate to. Seems unsur­pris­ing to me, kinda like say­ing the big­ger the hole, the eas­ier it is to get a ball through it. But I would expect Fitts’s Law to be ‘proven’ valid in the pos­i­tivist sense, and I think a paper show­ing this would be a great con­tri­bu­tion. Of course, the fifth or sixth paper show­ing the same thing, but in a slightly dif­fer­ent con­text, may not be as impor­tant, and may not merit pub­li­ca­tion (although this is appar­ently untrue, judg­ing by the wikipedia entry!)

    I think the same impor­tance about empiri­cism is valid in any research, qual­i­ta­tive and quan­ti­ta­tive. One of the prob­lems with com­puter sci­ence, in my view, is just this insis­tence on nov­elty and inter­est­ing­ness. Of course the great papers show some­thing unex­pected, but sci­ence also advances in incre­men­tal steps.

    • I com­plete agree with you; I was imply­ing that as a com­mu­nity, computing-related researchers have set­tled on these as cri­te­ria. I don’t per­son­ally think they’re always help­ful criteria.

      I think that some­times, our insis­tence on nov­elty leads us to focus on work that is nar­row, but exceed­ingly novel. There ought to always be space for incre­men­tal work about big problems.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="" highlight="">