Color Survey Results

Who in the rainbow can draw the line where the violet tint ends and the orange tint begins? Distinctly we see the difference of the colors, but where exactly does the one first blendingly enter into the other? So with sanity and insanity.
—Herman Melville, Billy Budd

Orange, red? I don’t know what to believe anymore!
—Anonymous, Color Survey

I WILL EAT YOUR HEART WITH A FUCKING SPOON IF YOU AKS ANY MORE QUESTIONS ABOUT COLORS
—Anonymous, Color Survey

Thank you so much for all the help on the color survey.  Over five million colors were named across 222,500 user sessions.  If you never got around to taking it, it’s too late to contribute any data, but if you want you can see how it worked and take it for fun here.

First, a few basic discoveries:

  • If you ask people to name colors long enough, they go totally crazy.
  • “Puke” and “vomit” are totally real colors.
  • Colorblind people are more likely than non-colorblind people to type “fuck this” (or some variant) and quit in frustration.
  • Indigo was totally just added to the rainbow so it would have 7 colors and make that “ROY G. BIV” acronym work, just like you always suspected. It should really be ROY GBP, with maybe a C or T thrown in there between G and B depending on how the spectrum was converted to RGB.
  • A couple dozen people embedded SQL ‘drop table’ statements in the color names. Nice try, kids.
  • Nobody can spell “fuchsia”.

Overall, the results were really cool and a lot of fun to analyze.  There are some basic limitations of this survey, which are discussed toward the bottom of this post.  But the sheer amount of data here is cool.

Sex

By a strange coincidence, the same night I first made the color survey public, the webcomic Doghouse Diaries put up this comic (which I altered slightly to fit in this blog, click for original):

It was funny, but I realized I could test whether it was accurate (as far as chromosomal sex goes, anyway, which we asked about because it’s tied to colorblindness) [Note: For more on this distinction, see my follow-up post]. After the survey closed, I generated a version of the Doghouse Diaries comic with actual data, using the most frequent color name for the handful of colors in the survey closest to the ones in the comic:

Basically, women were slightly more liberal with the modifiers, but otherwise they generally agreed (and some of the differences may be sampling noise).  The results were similar across the survey—men and women tended on average to call colors the same names.

So I was feeling pretty good about equality.  Then I decided to calculate the ‘most masculine’ and ‘most feminine’ colors.  I was looking for the color names most disproportionately popular among each group; that is, the names that the most women came up with compared to the fewest men (or vice versa).

Here are the color names most disproportionately popular among women:

  1. Dusty Teal
  2. Blush Pink
  3. Dusty Lavender
  4. Butter Yellow
  5. Dusky Rose

Okay, pretty flowery, certainly.  Kind of an incense-bomb-set-off-in-a-Bed-Bath-&-Beyond vibe.  Well, let’s take a look at the other list.

Here are the color names most disproportionately popular among men:

  1. Penis
  2. Gay
  3. WTF
  4. Dunno
  5. Baige

I … that’s not my typo in #5—the only actual color in the list really is a misspelling of “beige”.  And keep in mind, this is based on the number of unique people who answered the color, not the number of times they typed it.  This isn’t just the effect of a couple spammers. In fact, this is after the spamfilter.

I weep for my gender.  But, on to:

RGB Values

Here are RGB values for the first 48 out of about a thousand colors whose RGB values (across the average monitor, shown on a white background) I was able to pin down with a fairly high degree of precision:

The full table of 954 colors is here, also available as a text file here (I have no opinion about whether it should be used to build a new X11 rgb.txt except that seems like the transition would be a huge headache.)

The RGB value for a name is based on the location in the RGB color space where there was the highest frequency of responses choosing that name.  This was tricky to calculate.  I tried simple geometric means (conceptually flawed), a brute force survey of all potential center points (too slow), and fitting kernel density functions (math is hard). In the end, I used the average of a bunch of runs of a stochastic hillclimbing algorithm.  For mostly boring notes on my data handling for this list, see the comments at the bottom of the xkcd.com/color/rgb/ page.

Spelling and Spam

Spelling was an issue for a lot of users:

Now, you may notice that the correct spelling is missing.  This is because I can’t spell it either, and when running the analysis, used Google’s suggestion feature as a spellchecker:

A friend pointed out that to spell it right, you can think of it as “fuck-sia” (“fuch-sia”).

Misspellings aside, a lot of people spammed the database, but there were some decent filters in place.  I dropped out people who gave too many answers which weren’t colors used by many other people.  I also looked at the variation in hue; if people gave the same answer repeatedly for colors of wildly varying hue, I threw out all their results.  This mainly caught people who typed the same thing over and over.  Some were obviously using scripts; based on the filter’s certainty, the #1 spammer in the database was someone who named 2,400 colors—all with the same racial slur.

Map

Here’s a map of color boundaries for a particular part of the RGB cube.  The data here comes from a portion of the survey (1.5 million results) which sampled only this region and showed the colors against both black and white backgrounds.

The data for this chart is here (3.6 MB text file with each RGB triplet named).  Despite some requests, I’m not planning to make a poster of any of this, since it seems wrong to take advantage of all this volunteer effort for a profit; I just wanted to see what the results looked like.  You’re welcome to print one up yourself (huge copy here), but keep in mind that print color spaces are different from monitor ones.

Basic Issues

Of course, there are basic issues with this color survey.  People are primed by the colors they saw previously, which adds overall noise and some biases to the data (although it all seemed to even out in the end).  Moreover, monitors vary; RGB is not an absolute color space.  Fortunately, what I’m really interested in is what colors will look like on a typical monitors, so most of this data is across the sample of all non-colorblind users on all types of monitors (>90% LCD, roughly 6% CRT).

Color is a really fascinating topic, especially since we’re taught so many different and often contradictory ideas about rainbows, different primary colors, and frequencies of light. If you want to understand it better, you might try the neat introduction in Chapter 35 ofThe Feynman Lectures on Physics (Vol. 1), read Charles Poynton’s Color FAQ, or just peruse links from the Wikipedia article on color.  For the purposes of this survey, we’re working inside the RGB space of the average monitor, so this data is useful for picking and naming screen colors. And really, if you’re reading this blog, odds are you probably—like me—spend more time looking at a monitor than at the outdoors anyway.

Miscellaneous

Lastly, here are some assorted things people came up with while labeling colors:

Thank you so much to relsqui for writing the survey frontend, and to everyone else who sacrificed their eyeballs for this project.  If you have ideas and want to analyze these results further, I’ve posted the raw data as an SQLite dump here (84 MB .tar.gz file). It’s been anonymized, with IPs, URLs, and emails removed.  I also have GeoIP information; if you’d like to do geocorrelation of some kind, I’ll be providing a version of the data with basic region-level lat/long information (limited to protect privacy) sometime in the next few days. Note: The ColorDB data is the main survey.  The SatOnly data is the supplementary survey covering only the RGB faces in the map, and was presented on a half-black half-white background.)

And, of course, if you do anything fun with this data, I’d love to see the results—let me know at xkcd@xkcd.com.

1,287 replies on “Color Survey Results”

  1. In the RGB cube map, I find it interesting that Yellow and Cyan are right where they should be but Magenta is waaaay off (should be in the corner like the others). Don’t know what conclusion to draw from that. Perhaps monitors have trouble producing magenta faithfully. Perhaps nobody knows what magenta looks like.

    Like

  2. I’m not a medical doctor, but based on the brownish color of the “piss yellow”, I recommend that XKCD readers drink more water.

    Like

  3. I tear apart surveys apparently created half in jest and pretend to be more intelligent than my peers in order to stroke my ego until it’s spent and can’t stand erect anymore. Who am I?

    Like

  4. I love that women call pink the color that men call salmon. It’s like a scientific justification for all of those men who have yelled “It’s salmon!” in defense of their shirt.

    Like

  5. So, I’m not the only one who used “rusty orange”??? And is there actually a group of shades for which “teal” was the most common name?

    Like

  6. I… don’t really know why this is as awesome as it is. But it is hardcore awesome all the same.

    Like

  7. I can’t be the only person curious about what colours were named “Penis” and “Gay” the most, can I?

    Like

  8. Do you use the word “peruse” as it’s defined in the dictionary – to examine intently? Or as the culturally accepted definition – to skim? And how did that word get so messed up?

    Like

  9. The most powerful results of this fantastic study is that we know have populist definitions of the following colors:

    Color R G B
    shit yellow 188 174 43
    shit brown 131 94 29
    shit green 119 148 51

    Average values found from Like ‘shit% color%’, though variance was high (and absurd in some cases)

    Like

  10. This is good evidence that humans are sensitive to the presence of red. (Or is it just English-speaking humans?)

    On the red side of the colour cube, people will differentiate between purple, pink, magenta, red, maroon, orange, brown, yellow, mustard, gold, and olive.

    On the other side of the colour cube, you’ve got green, blue, and teal, and that’s it. (Oh, and cyan, thanks to nerds who are familiar with RGB.)

    Like

  11. This might just be my experience speaking, but I suspect the same demographic of men answering “Salmon” may be the same group that consistently comes to a party and insists that we play Halo: Combat Evolved instead of Halo 3.
    Granted, I’m the only one who ever chose Salmon.

    Like

  12. I LOVED this!
    I teach high school computer graphic design. I had my kids take it when we were studying color theory. I wanted them to see the correlation between color and real-life associations. We discussed different cultures and how, if a color is not named in a specific culture, then it is less likely to be recognized as a color (for example, there is a culture that does not recognize “purple” as a color, so purples are often called reds or blues). I am very excited to share these results with my students. Thank you!!!

    Like

  13. Randall – scientific advancement often waits for better ways to visualize concepts and data; I expect we will see your name in engineering textbooks eventually because you have quite a gift.

    Like

  14. As a color nerd, I have to say that this is awesome! (Yeah…I totally used to obsessively organize my 256 crayons by color when I was little…)

    And I totally agree with Lorkas about the “salmon-colored shirt”. My sister and I like to tease our father about his “pink” shirt which upsets him a lot.

    And related to colors…I have to share the following website. This is for those of you who ever wondered what color darkness is. http://simple.wikipedia.org/wiki/Darkness

    Like

  15. So it dawns on me that the colour most people label ‘magenta’ is nothing like the #ff00ff atrocity that it really is. How utterly weird!

    Like

  16. Not to be anal or anything – although there’s nothing wrong with being anal – it’s a free country afterall – at least last time I checked – oh, yeah, where was I, yeah ok…your RGB values are actually HEX values.

    Like

  17. what did you do with the colourblind results? did you just throw them out? I thought you were going to produce a map so that we can finally know what colours are what for them.

    or something.

    Like

  18. Can we get the regional IP address database? I’d like to try to hook that into the results. Thanks!

    Like

  19. When I was 26 – I’m 56 now – I was trying to write fiction, and discovered the simple act of describing the world around us was far from simple. What a colour was – the word for it, that is – was perhaps beyond language. There was no reference, either – paint stores, art supply shops, artists, all told me that each person and company had its own list, and most were numerical. I argued that Renaissance painters, for example, would have had a group language that they used to discuss their work, and how they got certain tints. Which argued for colour being ephemeral and by consensus … that no fixed “language” for colour exists. The only “fixed” colours were by analogy, as in “Granny Smith apple green”, useless for those not familiar with Granny Smith apples. By further thought, I came to the conclusion that ALL knowledge or understanding was by analogy. Which argues for the inquisitive mind to a) get as broad a personal knowledge of the universe as possible, and b) to realize that if he hasn’t seen it or done it, he can claim only a shadow of understanding. All because I didn’t know if her lipstick was “pink” or “hot pink”.

    Like

  20. “A friend pointed out that to spell it right, you can think of it as “fuck-sia” (“fuch-sia”).”

    But…your Google spellcheck shows it as “fus-chia”!

    Like

  21. It amazes me how some people cannot understand that this is never meant to be a rigorous study. On the other hand, it just proves how awesome this is.

    Like

  22. Hey… I’m pretty sure that I can spell fuchsia and did it! To be sure I checked dictionary beforehand I clearly remember that.

    Though, after a while all of the colors looked same to me so i started typing names in a pattern which is… color(k-1)ish color(k). So maybe it would be better to give a color then black a little then a new color? I dunno anymore every thing is greenish purple here.

    Like

  23. Just curious. Were there statistically significant differences in the data between all the variants of “poop”? I would imagine at the very least, “poo,” “poop,” and “shit” would all refer to the same color, just with slightly different registers.

    Like

  24. Firtsly colourblind here the survey drove me nuts.

    also after seeing your section on spam i think your survey atrracted /b/tards

    Like

  25. L.O.L’ed enough to get a raised eyebrow from my boss….

    Or more accurately – brilliant.

    Like

  26. “canonymous,” anna: It obviously isn’t rigorous, and it may indeed be half in jest, but it honestly isn’t funny enough or clever enough (for me) to be worthwhile. Why even bother with the trappings of legitimacy? One has to have some understanding of something if one is going to make fun of it.

    Half ingested and fully indigestible. Enough.

    Like

Comments are closed.