what’s surprising?

A com­mon com­plaint of research is that it’s not “sur­pris­ing.” For exam­ple, a reviewer might say, “The study was well done, but the results weren’t really that sur­pris­ing.”, or, “I found the results a bit predictable.”

But what do these state­ments really mean? Do they mean, “Had you asked me the research ques­tion, I could have guessed the results with some degree of con­fi­dence.”? Or, “If you asked your research ques­tion of 100 experts, 95 of their guesses would have been right.”?

Maybe we might intend for them to mean that, but they don’t actu­ally cap­ture what hap­pens when a reviewer reads a paper. What usu­ally hap­pens is the reviewer reads the research ques­tion and thinks, “Hm, I could guess, but I’m not sure.” Then, upon read­ing the results, the reviewer thinks, “Well of course, that’s not sur­pris­ing at all.” The test exe­cuted here is not whether an expert can con­fi­dently pre­dict the answer to a research ques­tion, but whether in hind­sight it seems plau­si­ble that an expert could have guessed the result.

In this sense, what makes a result “sur­pris­ing” has less to do with what we know as sci­en­tists and more to do with what we think we know about what other researchers know.

This social fab­ric that appar­ently under­lies our judge­ments of what is known has other inter­est­ing effects on what is accepted as advanc­ing knowl­edge. For exam­ple, that some find­ing has not been pub­lished, is rarely a sat­is­fac­tory argu­ment for why some­thing should be pub­lished. What under­lies this belief it that it is not our goal as sci­en­tists to doc­u­ment every­thing that we know. Instead, it is our job to doc­u­ment the sub­set of what we know that is inter­est­ing, impor­tant, and surprising.

But aren’t most judge­ments of what is inter­est­ing and impor­tant are grounded in the present? How are we to know what is inter­est­ing or impor­tant in the future? Who are we to judge that the future of human­ity will find no inter­est in the unin­ter­est­ing, unim­por­tant results of today? Take, for exam­ple, a recent review I wrote on a paper about using mul­ti­touch, table­top dis­plays for engi­neer­ing design. I argued that it was unclear what prob­lem was being solved. But what if it solves a prob­lem that doesn’t exist yet? Or what if it solves it in such a way that another prob­lem I hadn’t even thought of becomes triv­ial? On what basis could I really judge whether the work would have future worth?

All of this makes me think I don’t give papers a fair shake. Maybe I’ll adopt a new review­ing pro­to­col: instead of read­ing the paper straight through and record­ing my thoughts, I’ll look at the authors’ research ques­tion and try to answer it myself for five min­utes. Then, I’ll read the paper and if they came up with a dif­fer­ent solu­tion or answer that mine (that is of course reli­able, sound, etc.), whether or not I’m sur­prised, the authors get credit for dis­cov­er­ing or invent­ing some­thing that I didn’t know. Of course, If I guessed their results or solu­tion in a mere five min­utes, what could they pos­si­bly have contributed?

the semblance of objectivity in numbers

I just received my first ever first-authored con­fer­ence paper rejec­tion from FSE. The pri­mary rea­sons, quoted from the reviews, include:

  • The qual­i­ta­tive nature of the study … is liable to mis­in­ter­pre­ta­tion and bias.”
  • I was expect­ing a quan­ti­ta­tive analy­sis: is there any cor­re­la­tion between some of the char­ac­ter­is­tics and between [the results] and the time a bug takes to resolve and its res­o­lu­tion status?
  • I would have thought that what types of ele­ments to look for in dis­cus­sion should be decided before by the researchers as it should be based on the problem”
  • I was expect­ing con­crete advice on HOW the tools should struc­ture the discussion.”

I was hop­ing the review­ers would have been more epis­te­mo­log­i­cally informed. For exam­ple, the first and sec­ond quotes are quite telling: they imply that some forms of empiri­cism are not sub­ject to mis­in­ter­pre­ta­tion or bias. But quan­ti­ta­tive empir­i­cal mea­sures are just as sub­ject to bias as any other mea­sure. For exam­ple, if I had counted cer­tain kinds of data and run cor­re­la­tions between these counts and other out­come mea­sures, not only would one in twenty of them be “sta­tis­ti­cally sig­nif­i­cant” by chance, but whether there was any real mean­ing in the vari­ables depends on the con­struct valid­ity of the quan­ti­ta­tive mea­sure­ments. For exam­ple, if I had cor­re­lated hyper­boles with bug res­o­lu­tion time, not only would the hyper­bole mea­sure have the same lim­i­ta­tions as it did as a qual­i­ta­tive clas­si­fi­ca­tion, but the bug res­o­lu­tion time would have any num­ber of con­tex­tual fac­tors that could influ­ence its true reflec­tion of the hyperbole’s impact on con­sen­sus. Trans­form­ing empir­i­cal obser­va­tions into num­bers does NOT make them objec­tive, nor does it pre­vent bias and mis­in­ter­pre­ta­tion.

The third quote is ironic: this reviewer seems to believe that the only way to ana­lyze a prob­lem is to make some assump­tion about its nature upfront. The whole point of qual­i­ta­tive research is that the more you make upfront assump­tions, the more you bias your find­ings. What this reviewer is propos­ing would have less­ened the objec­tiv­ity of the results and pre­vented us from uncov­er­ing the trends we did.

The last quote reveals the sys­temic bias in soft­ware engi­neer­ing research (and also some HCI venues): qual­i­ta­tive stud­ies are only valu­able if they explic­itly inform design. What this really reduces to is a view that mate­r­ial goods are real work, but the pro­duc­tion of knowl­edge comes for free. Build­ing a sys­tem or automat­ing some activ­ity, even if the sys­tem and automa­tion are entirely imprac­ti­cal in the real world, is more valu­able than under­stand­ing the real world. The com­ment also reveals the reviewer’s lack of under­stand­ing about design: inno­va­tions don’t come from stud­ies, they come from peo­ple. Stud­ies can sup­port design deci­sions (and the results through­out our rejected sub­mis­sion have been quite valu­able in our cur­rent design efforts), but they can­not gen­er­ate ideas. Peo­ple gen­er­ate ideas.

Had I really wanted the paper in, I would have lit­tered the sub­mis­sion with arbi­trary, but seem­ingly objec­tive quan­tifi­ca­tions and cor­re­la­tions of our data (which is what most quan­tifi­ca­tions are in soft­ware engi­neer­ing papers). This has worked in past papers and is a tried and true workaround for the soft­ware engi­neer­ing community’s lack of expe­ri­ence with qual­i­ta­tive meth­ods. Review­ers would have thought, “I don’t get all of this qual­i­ta­tive stuff, but these num­bers are great.” I decided not to do this on prin­ci­ple, since doing so would have only made the results seem more objec­tive with­out adding any real objectivity.

So much for prin­ci­ple. Time to start cor­re­lat­ing things!

tough T

I just spent a day at Edward Tufte’s course on infor­ma­tion design at the Seat­tle Mar­riott Water­front. I’ve always known his work, I’ve talked about it in design classes, I’ve told stu­dents to read his books, but not once have I heard him speak. Now I can con­fi­dently say that his cap­tions speak louder than words. Snicker.

That’s not to say he wasn’t insight­ful. The books have always been a nice trans­la­tion of clas­sic design prin­ci­ples into sta­tic visual infor­ma­tion design, but most of the course was sim­ply him par­rot­ing his own words. What made it unbear­able was that he spoke them with the life­less apa­thy of a sta­tis­tics pro­fes­sor. Oh wait, he was one.

Aside from his lack of spark, there were a num­ber of nice things about the day. I got a box full of his books; I got a refresher on visual infor­ma­tion design; I had a chance to think more about forms of dis­sem­i­na­tion for my research (I tire of lim­it­ing my influ­ence to aca­d­e­mic pub­li­ca­tions). It was also a nice calm before my early May storm of deadlines.

halfway home

I’m back in Seoul, with a lot to say, but I won’t say much.

First, let me address the ele­phant in the room. Hello third world visit epiphany cliche-aphant. How are you today? Yes, I’ve returned from India and I’ve seen a lot of dis­turb­ing things. I saw the Mus­lim slums of Mum­bai, naked chil­dren run­ning through the streets, tweens sell­ing day old news­pa­pers for a rupee and home­less moth­ers beg­ging for money with their sick and sleep­ing chil­dren dan­gling from their arms. Peo­ple were dirty, water wasn’t potable, wild dogs slept in the street, igno­rant and apa­thetic about the armada of auto-rickshaws swerv­ing around them.

But, I also wit­nessed human expe­ri­ence of every other kind. I watched Rolex-laden busi­ness­men step over old women lay­ing on the side­walk in the heat. I saw fam­i­lies of five cling­ing to a motor­cy­cle and to each other, smil­ing, laugh­ing, and close in a way I’ve never seen west­ern fam­i­lies. In all of the squalor and dirt and poverty, I saw the exact same kind of joy that those of us in post-industrialized coun­tries seem to strug­gle to find. I saw noth­ing about human expe­ri­ence in India that was sub­stan­tially dif­fer­ent from the rest of the world I’ve seen, other than the cloth­ing that peo­ple wore and range of their reach into the rest of the world.

Would those chil­dren in the slums be any bet­ter off with a pair of Nikes and a ster­ile heated two bed­room condo? Would they laugh any more than I saw them laugh, play any more than I saw them play? There cer­tainly are some absolute improve­ments that every­one deserves, food, health, shel­ter, but beyond these Maslows, its dif­fi­cult for me to think of a legit­i­mate rea­son why my US lifestyle would bring any more hap­pi­ness or joy.

Yet as much as I won’t judge the qual­ity of life and the reach of India’s peo­ple into the global com­mu­nity, I can’t be impar­tial. I just spent a week engag­ing with the aca­d­e­mic com­puter sci­ence com­mu­nity in India, form­ing rela­tion­ships and watch­ing unfold an incred­i­ble attempt at recre­at­ing a US style schol­arly com­mu­nity. I’m part of this dia­logue between India and the west­ern world, help­ing to prop­a­gate my schol­arly cul­ture. Whether I pass high-minded judge­ment on India’s qual­ity of life or not, I’ve now actively engaged in help­ing India’s aca­d­e­mic com­mu­nity mimic and mir­ror that of other nations, with con­fer­ences, posters, pan­els, and papers, and all their inher­ent lim­i­ta­tions and west­ern bias.

In some ways, I wish India would find its own way of being schol­arly. I wish it would estab­lish its own research com­mu­ni­ties, rather than focus­ing solely on engag­ing with those in the US and Europe. I want it to find some­thing com­pat­i­ble with its peo­ple and then com­mu­ni­cate these ways to the west­ern world. By try­ing to mimic the rest of the global community’s schol­arly prac­tices, it ghet­toizes its own efforts. If India invented its own prac­tices around schol­arly pur­suits, it would be about apples and oranges instead of Hon­ey­crisp and Fuji. For exam­ple, instead of try­ing to have poster ses­sions (and fail­ing because of the lack of high qual­ity poster print­ers), what if they drew their posters on white­boards, chalk­boards, or paper? They could find inno­v­a­tive ways of com­mu­ni­cat­ing their work, and even find bet­ter ways than the west­ern world. If I were India and its aca­d­e­mic lead­ers, I would look at this as an oppor­tu­nity to inno­vate and rein­vent aca­d­e­mic prac­tices, rather than mimic them.

Time to board.

halfway to India

There’s some­thing absurd about this trip to India. The offi­cial rea­son for my trip—to speak at the India Soft­ware Engi­neer­ing Conference—doesn’t seem like rea­son enough to spend 2 hours at an air­port, 11 hours in an air­plane, 3 more hours in South Korea, 9 more hours on a plane, 5 hours wait­ing in Mum­bai, then 4 hours dri­ving to Pune. Let alone repeat­ing this jour­ney five days later.

Yet all of the intan­gi­ble rea­sons for going far out­weigh the incon­ve­nience of all of this lost time and sleep. On my Korean Air flight and here in Incheon, I’m sur­rounded by what I find to be a beau­ti­ful, play­ful, excit­ing lan­guage. I over­heard engi­neers talk­ing about the parts they’ll over­see the ship­ping of, old grand­moth­ers return­ing home to Seoul, and lit­tle babies mak­ing the jour­ney to and from Asia and the west coast. There’s a fas­ci­nat­ing sub­cul­ture of fre­quent Asian trav­el­ers, par­tic­u­larly those who take the cheaper flights offered by Korean Air, and see­ing it throb and pulse is well worth the lost sleep.

In about 12 hours, I’ll be on the other side of planet Earth, dri­ving to the cen­ter of the Indian IT indus­try with a man named Mr. Mahesh, all so I can speak to a room of Indian researchers about some bits I flipped last year.

what up, blog?

How’ve you been? We haven’t chat­ted in a while. What up?

Really? I’m sorry. I fig­ured that you had other peo­ple to hang out with. You totally should have called! I was busy, but I wasn’t that busy, you can always give me a ring, or write.

Well, I guess that’s true, but you’d think that since I’m anthro­po­mor­phiz­ing you in my head, you’d just mag­i­cally acquire the abil­ity to write, speak, and act upon the world. No con­straints, right, what­ever goes? You are, after all, just part of my imagination.

Con­straints? But I thought imag­i­na­tion was all about free­dom. Dreams, you know, fly­ing, falling, super­pow­ers, there are so many things that are pos­si­ble. It seems to me that when we really let our minds be free, we can imag­ine any­thing. Why would my brain limit my imag­ined per­son­i­fi­ca­tion of my blog to com­mu­ni­cat­ing only with me and not with the rest of the world?

Wait: so you’re say­ing that what I think, what I dream, is all just lim­ited to things I already know? But what about learn­ing? I can learn new ideas, acquire new per­spec­tives, and use this new knowl­edge to break con­straints. For exam­ple, today I learned about a whole body of work asso­ci­at­ing pro­noun use in con­ver­sa­tion to per­son­al­ity dis­or­ders. Now, when I reflect on my speech and oth­ers’, I’ll gen­er­ate new per­spec­tives and new ideas about the mean­ing in the words. There are new ideas. There are wheels, there are inter­nets. I fun­da­men­tally dis­agree that every­thing has been dis­cov­ered, every­thing has been invented, that every­thing is just a mashup.

Hm. You have a point. If I imag­ined you actu­ally talk­ing to peo­ple in the world, or even pre­tended that you were a sep­a­rate entity from me, some might call me crazy. But no one would really believe I was crazy unless I acted on this believe. And isn’t it nat­ural to dis­so­ci­ate one’s self from one’s expres­sion of self? Isn’t that a nat­ural part of writ­ing? Unlike speech, I’m cre­at­ing an arti­fact, some­thing that lit­er­ally and phys­i­cally does exist sep­a­rate of me. There­fore it seems per­fectly nat­ural to think of you as sep­a­rate from me because you lit­er­ally are.

No, I’m try­ing to make a point. You can influ­ence the world if you want, because you exist sep­a­rate of me. You may not have a body or a brain, but you have a con­nec­tion to me. You’re like my child. And because of this, every word of you I write and then read causes me to reflect back on my own exis­tence. You have arms and legs, but you can only use them to poke and prod and ping my brain. You may not be able to act on the whole world, but you do have influ­ence on me. See, you’re freer than you think!

That’s true. I guess you’re free, but only on my terms. You’re less like a child and more like a robot. I’ve pre­de­ter­mined your scope of influ­ence. You may lead to unex­pected things, but they only occur within a cer­tain range of pos­si­bil­ity. Of course, a moment ago, you were say­ing I was under the same con­straints, that the scope of my own exis­tence is pre­de­ter­mined by my cre­ators and my world.

Yeah, I’ll have to think about that. I don’t know. You’re pretty cool, blog. We should hang out more.

why do researchers choose the disciplines they do?

I been giv­ing some thought lately to my peers’ career choices. Why do fac­ulty choose the dis­ci­plines they do? There are the obvi­ous rea­sons, like self-efficacy. For exam­ple, a physics pro­fes­sor prob­a­bly pur­sued a Ph.D. in physics because she found her­self good at it. An Eng­lish pro­fes­sor may have been hon­ored for his writing.

But I think there’s some­thing else under­ly­ing these choices. Con­sider some of the extremes, such as math­e­mat­ics and phi­los­o­phy, or social work and edu­ca­tion. Is there some­thing about the deter­min­ism of math­e­mat­ics that makes it attrac­tive to cer­tain per­son­al­i­ties? Are there cer­tain types of peo­ple who enjoy rev­el­ing in logic and abstrac­tion? Do these char­ac­ter­is­tics of these areas of thought make peo­ple feel safe some­how? And the more human­i­tar­ian fields: is it dri­ven by a strong desire to exer­cise val­ues and moral­ity? Sci­en­tists are also inter­est­ing: does the search for truth make them feel noble, or is their some­thing trilling about the hunt for explanations?

I sup­pose we all have in com­mon the desire to fill our lives with as much thought as pos­si­ble. Is it insa­tiable curios­ity or just a par­tic­u­larly low thresh­old for amuse­ment? By that I mean we can engage our­selves in the small­est of details in the nat­ural and arti­fi­cial worlds, where as oth­ers, who could care less about research, require a much greater mag­ni­tude of nov­elty to be engaged.

a better fit

I’ve been think­ing about buy­ing a new car for mostly the wrong reasons.

Petty rea­son #1. My Civic has a ter­ri­ble amount of road noise. Dri­ving the local roads in Seat­tle, even at a pal­try 25 mph, it’s like rid­ing a coal mine cart in an Indi­ana Jones movie.

Petty rea­son #2. My Civic has a cas­sette radio and play­ing my tunes through my cas­sette adapter is, and always has been, like lis­ten­ing to a live record­ing on a card­board speaker.

Petty rea­son #3. I love rid­ing my bike with Elle, but I can’t get bike rack that fits through my apartment’s garage. I need a car that can effort­lessly fit a big and small bike.

Petty rea­son #4. My Civic has a great turn­ing radius and is small enough for city dri­ving, but improv­ing on both these dimen­sions would be even bet­ter for the nar­row Seat­tle roads. It would open up a whole new world of park­ing opportunities!

Petty rea­son #5. My Civic is aging and approach­ing the high main­te­nance phase of its life. Who wants to spend time replac­ing a tim­ing belt here, tires there as it slowly falls apart? Bet­ter to rid of it now and save that time for some­thing more enjoy­able, like eat­ing cheap eth­nic food on the Ave.

I sup­pose the only legit­i­mate rea­son I have is that I feel sur­rounded by emblems of my bro­ken mar­riage. I live with the car, the fur­ni­ture, the bed, and the kitchen­ware of a failed 8 year rela­tion­ship, and as shal­low as it seems to gripe about things, it’s really hard to move on when I’m liv­ing and using things from my painful past. I don’t know how much that’s worth; I don’t think I could put a price on it. But surely a $150 monthly car pay­ment is well below its value.

So what do I want? I’m look­ing at a Black­berry Pearl Honda Fit. They’re small, fuel eco­nom­i­cal, with extremely ver­sa­tile space for peo­ple and things. I’m used to Hon­das, their usabil­ity is one of the best in the car indus­try, and they’re pretty cheap. As much as I hate cars, I do spend a lot of time com­mut­ing back and forth to Kirk­land, so why not have a nice, com­pact lit­tle car that can zip around those Lake Wash­ing­ton curves? The Fit wouldn’t be dra­mat­i­cally bet­ter than my ’02 Civic, but would be an incre­men­tal improve­ment, in a petty sort of way.

sketching with words

I had a meet­ing today with the Human Inter­ac­tions in Pro­gram­ming group at Microsoft Research today, along with a few other UW peo­ple, to talk about poten­tials for col­lab­o­ra­tion. Lots of oppor­tu­ni­ties came out of our meet­ing, but one thing stuck around in my head today after David Notkin made a com­ment about peo­ples’ mis­per­cep­tions about what soft­ware is and is capa­ble of. I replied to his com­ment this notion that there are all kinds of ver­sions of a soft­ware arti­fact: ver­sions on paper, ver­sions in specs, ver­sions float­ing around in devel­op­ers’ heads, ver­sions in users’ heads. And of course, the real ver­sion that ends up being built. The intrigu­ing thing here is how all of these dif­fer­ent con­cep­tions end up affect­ing the actual soft­ware arti­fact. For exam­ple, think of two devel­op­ers dis­cussing some bro­ken fea­ture and how they want to fix it: if you really lis­ten, the dis­cus­sion is about future ver­sions of the cur­rent ver­sion and all of the qual­ity impli­ca­tions of the change. They fill the dis­cus­sion with descrip­tions of the slightly mod­i­fied sys­tem at a vari­ety of lev­els of abstrac­tion, with the goal of debat­ing the mer­its of the mod­i­fied sys­tem. The same con­ver­sa­tion might occur between a pro­gram man­ager and a mar­keter, dis­cussing ways of describ­ing what it is that is being built.

It seems to me that all of these dif­fer­ent ver­sions of the sys­tem are essen­tially sketches, in the broad­est design sense.  They’re ver­bal sketches, and likely so because pro­grams and soft­ware sys­tems are so tricky to draw. So when two devel­op­ers debate a change, they’re essen­tially draw­ing out the mod­i­fi­ca­tion in the ether, let­ting the ideas linger in the air, cri­tique the ideas in their phono­log­i­cal loops. I helped with a study by Mauro Cheru­bini look­ing at devel­op­ers’ sketches, and the result was quite sim­i­lar: in most cases, devel­op­ers use dia­grams in order to sup­port dis­cus­sions about changes to code.

I’m plan­ning some stud­ies look­ing at soft­ware devel­op­ers’ design dis­cus­sions and this idea of a ver­bal sketch could be quite help­ful in find­ing mean­ing in the data.

my iPhone hates you

In fact, it hates every­one, espe­cially other iPhone users. Six hour text mes­sage delays. Voice­mails show­ing up a day later. In some cases, I’ll never even get the mes­sage. It’s dri­ving me crazy! I had sit­u­a­tions where peo­ple email me ask­ing why I’m not pick­ing up, and I’ll call them, but they won’t get my call. This is the best phone I’ve ever had, except for the phone part.

That said, I’m on the phone with Apple and AT&T right now, and they’re pretty awe­some. They know what they’re talk­ing about, they’re clear in their instruc­tions, and the qual­ity of the call is great, with­out any dis­tract­ing back­ground call cen­ter noise. Now if only they can fix it…