Color Survey Results

Who in the rainbow can draw the line where the violet tint ends and the orange tint begins? Distinctly we see the difference of the colors, but where exactly does the one first blendingly enter into the other? So with sanity and insanity.
—Herman Melville, Billy Budd

Orange, red? I don’t know what to believe anymore!
—Anonymous, Color Survey

I WILL EAT YOUR HEART WITH A FUCKING SPOON IF YOU AKS ANY MORE QUESTIONS ABOUT COLORS
—Anonymous, Color Survey

Thank you so much for all the help on the color survey.  Over five million colors were named across 222,500 user sessions.  If you never got around to taking it, it’s too late to contribute any data, but if you want you can see how it worked and take it for fun here.

First, a few basic discoveries:

  • If you ask people to name colors long enough, they go totally crazy.
  • “Puke” and “vomit” are totally real colors.
  • Colorblind people are more likely than non-colorblind people to type “fuck this” (or some variant) and quit in frustration.
  • Indigo was totally just added to the rainbow so it would have 7 colors and make that “ROY G. BIV” acronym work, just like you always suspected. It should really be ROY GBP, with maybe a C or T thrown in there between G and B depending on how the spectrum was converted to RGB.
  • A couple dozen people embedded SQL ‘drop table’ statements in the color names. Nice try, kids.
  • Nobody can spell “fuchsia”.

Overall, the results were really cool and a lot of fun to analyze.  There are some basic limitations of this survey, which are discussed toward the bottom of this post.  But the sheer amount of data here is cool.

Sex

By a strange coincidence, the same night I first made the color survey public, the webcomic Doghouse Diaries put up this comic (which I altered slightly to fit in this blog, click for original):

It was funny, but I realized I could test whether it was accurate (as far as chromosomal sex goes, anyway, which we asked about because it’s tied to colorblindness) [Note: For more on this distinction, see my follow-up post]. After the survey closed, I generated a version of the Doghouse Diaries comic with actual data, using the most frequent color name for the handful of colors in the survey closest to the ones in the comic:

Basically, women were slightly more liberal with the modifiers, but otherwise they generally agreed (and some of the differences may be sampling noise).  The results were similar across the survey—men and women tended on average to call colors the same names.

So I was feeling pretty good about equality.  Then I decided to calculate the ‘most masculine’ and ‘most feminine’ colors.  I was looking for the color names most disproportionately popular among each group; that is, the names that the most women came up with compared to the fewest men (or vice versa).

Here are the color names most disproportionately popular among women:

  1. Dusty Teal
  2. Blush Pink
  3. Dusty Lavender
  4. Butter Yellow
  5. Dusky Rose

Okay, pretty flowery, certainly.  Kind of an incense-bomb-set-off-in-a-Bed-Bath-&-Beyond vibe.  Well, let’s take a look at the other list.

Here are the color names most disproportionately popular among men:

  1. Penis
  2. Gay
  3. WTF
  4. Dunno
  5. Baige

I … that’s not my typo in #5—the only actual color in the list really is a misspelling of “beige”.  And keep in mind, this is based on the number of unique people who answered the color, not the number of times they typed it.  This isn’t just the effect of a couple spammers. In fact, this is after the spamfilter.

I weep for my gender.  But, on to:

RGB Values

Here are RGB values for the first 48 out of about a thousand colors whose RGB values (across the average monitor, shown on a white background) I was able to pin down with a fairly high degree of precision:

The full table of 954 colors is here, also available as a text file here (I have no opinion about whether it should be used to build a new X11 rgb.txt except that seems like the transition would be a huge headache.)

The RGB value for a name is based on the location in the RGB color space where there was the highest frequency of responses choosing that name.  This was tricky to calculate.  I tried simple geometric means (conceptually flawed), a brute force survey of all potential center points (too slow), and fitting kernel density functions (math is hard). In the end, I used the average of a bunch of runs of a stochastic hillclimbing algorithm.  For mostly boring notes on my data handling for this list, see the comments at the bottom of the xkcd.com/color/rgb/ page.

Spelling and Spam

Spelling was an issue for a lot of users:

Now, you may notice that the correct spelling is missing.  This is because I can’t spell it either, and when running the analysis, used Google’s suggestion feature as a spellchecker:

A friend pointed out that to spell it right, you can think of it as “fuck-sia” (“fuch-sia”).

Misspellings aside, a lot of people spammed the database, but there were some decent filters in place.  I dropped out people who gave too many answers which weren’t colors used by many other people.  I also looked at the variation in hue; if people gave the same answer repeatedly for colors of wildly varying hue, I threw out all their results.  This mainly caught people who typed the same thing over and over.  Some were obviously using scripts; based on the filter’s certainty, the #1 spammer in the database was someone who named 2,400 colors—all with the same racial slur.

Map

Here’s a map of color boundaries for a particular part of the RGB cube.  The data here comes from a portion of the survey (1.5 million results) which sampled only this region and showed the colors against both black and white backgrounds.

The data for this chart is here (3.6 MB text file with each RGB triplet named).  Despite some requests, I’m not planning to make a poster of any of this, since it seems wrong to take advantage of all this volunteer effort for a profit; I just wanted to see what the results looked like.  You’re welcome to print one up yourself (huge copy here), but keep in mind that print color spaces are different from monitor ones.

Basic Issues

Of course, there are basic issues with this color survey.  People are primed by the colors they saw previously, which adds overall noise and some biases to the data (although it all seemed to even out in the end).  Moreover, monitors vary; RGB is not an absolute color space.  Fortunately, what I’m really interested in is what colors will look like on a typical monitors, so most of this data is across the sample of all non-colorblind users on all types of monitors (>90% LCD, roughly 6% CRT).

Color is a really fascinating topic, especially since we’re taught so many different and often contradictory ideas about rainbows, different primary colors, and frequencies of light. If you want to understand it better, you might try the neat introduction in Chapter 35 ofThe Feynman Lectures on Physics (Vol. 1), read Charles Poynton’s Color FAQ, or just peruse links from the Wikipedia article on color.  For the purposes of this survey, we’re working inside the RGB space of the average monitor, so this data is useful for picking and naming screen colors. And really, if you’re reading this blog, odds are you probably—like me—spend more time looking at a monitor than at the outdoors anyway.

Miscellaneous

Lastly, here are some assorted things people came up with while labeling colors:

Thank you so much to relsqui for writing the survey frontend, and to everyone else who sacrificed their eyeballs for this project.  If you have ideas and want to analyze these results further, I’ve posted the raw data as an SQLite dump here (84 MB .tar.gz file). It’s been anonymized, with IPs, URLs, and emails removed.  I also have GeoIP information; if you’d like to do geocorrelation of some kind, I’ll be providing a version of the data with basic region-level lat/long information (limited to protect privacy) sometime in the next few days. Note: The ColorDB data is the main survey.  The SatOnly data is the supplementary survey covering only the RGB faces in the map, and was presented on a half-black half-white background.)

And, of course, if you do anything fun with this data, I’d love to see the results—let me know at xkcd@xkcd.com.

1,287 replies on “Color Survey Results”

  1. I wish I knew about this website a lonnnng time ago. I absolutely love the intense geekery, and the thorough-ness with which you ask the question, What color is this?

    I tried clicking on the color survey, but it didn’t work, and I suppose that’s because it’s December, and you wrote this in May.

    I vote that you do some stuff about chemistry things so I can use it in class. I’m trying to peruse through your stuff – I’ve got some physics comics for when we’re in circuits. These will be awesome! I would also like to somehow make a poster of your misusing slang comic. This is a running joke in my classes right now, and would be absolutely perfect. Can I purchase one or something? It would be awesome. Please let me know!

    Like

  2. I’d love to play with this survey – any chance you still have a working link? When I attempt to access it, it comes up 404 Found Error. 😦

    I mean, it’s winter break, what’s a hard-working English teacher supposed to do with her time off but play online? =)

    Like

  3. Wow, didn’t know it’s so hard for English native speakers to spell “fuchsia” correctly.
    BUT I DO understand after I heard how it’s pronounced in English (similar to fusion). My god, what is this?
    It’s a lot easier in German – it’s pronounced similar to “fuck-sia” (only with an u like in bush). English orthography IS horrible indeed…

    Back to the colors: I’m a synesthete (mostly with numbers/letters/words and colors) so I’m very sensitive if it comes to colors. My boyfriend is always teasing me with something like this: “So, this word is blue? Like sky blue?” And I’m like “Oh my god look at it, LOOK AT IT, it’s clearly midnight blue with a soft luster of graphite, can’t you see it?! You’re not even trying!!” Haha always mad at this.

    AND I HATE zyklam. I don’t know how it’s called in English (can’t find it), but this is the color:

    My god, I can’t even look at it. It’s unbelievable disgusting. I mean, look at it, it should be full of passion or intense or strong and it’s so… lame and nearly fading. Eeeeew.

    Like

  4. I’m not a synesthete, but that’s Pepto (-Bismol) Pink, so no wonder you don’t like it.

    Like

  5. Oh. My. Gawd.
    My eyes are smoking, and it ain’t the good shit.
    I worked in the printing industry for years, and this article brings back memories best recalled with hot sauce injected in my urethra (for comic relief).
    Keep up the good work!

    Like

  6. Just writing to say that I cried tears while laughing at that final list of colors. And now my jaws hurt.

    Thanks 🙂

    Like

  7. Nice post, laughed a lot, though pity the surver isn’t online anymore (according to the replies above).

    Like

  8. Fascinating. A lot of comments that people felt overwhelmed by having to make so many choices. Obviously some people felt motivated to find unique signifiers,while other were content with broad classifications. I am curious how the results would differ if the survey was done speed round style (first,quick response)versus a deliberative, unlimited time style.

    Like

  9. Wow! i did not know that men and women personality (“cultural difference”) would show that much!

    Men seems to be less interested eventough studies has shown that men are more keen to visual displays as women are more reacting on audio.

    Like

  10. This dress is perfect Couture Wedding Dresses Plus size Wedding Dresses fantastic dress over all Maternity Wedding Dresses The dress is simple beautiful Destination Wedding Dresses Blue Wedding Dresses Vintage Wedding Dresses I fellcompletely in love with this dress when i saw it Unique Wedding Dresses Romantic Wedding Dresses Cocktail Dresses very hard for me to find something that fits good and it is perfect Evening Dresses Recieved this dress much sooner than expected and it is beautiful Celebrity Dresses Special Occasion Dresses Blue Wedding Gowns This dress is perfect Prom Dresses Sweet 16 Dresses Quinceanera Dresses.

    Like

  11. I see a problem with different languages though.. like I know all the colour names in english…? nope.

    ps: the captchas suck, I had to look at at least 20 to find one readable.

    Like

  12. I don’t know if I got this right but did you or Doghouse Diaries come up with the idea that men have harder to tell the difference between two similar colors than women?
    It would seem to me that this is more of a culture issue. Women have a completely different culture and from men and they have a ‘slightly’ different vocabulary than men. So if women normally learn roughly 50 colors and men learn about 20 (in a lifetime perhaps) then of course most men are going to generalize a lot when confronted with a color naming test. That has little to do with color blindness, even if we scientifically know that women have more color receptors than men.
    What I mean with culture is, for example, that women are very interested in how their clothing and their make-up look. And that is true for any human culture, women are more concerned about this, and thus they have a larger vocabulary for it.
    If you compare two languages from any two different cultures you often see that they don’t have the same number of words for similar things. Some languages have very few words for the English word “death”, and some have a lot of different words with similar meanings. And the same is true for men and women I would say.

    Like

  13. If you compare two languages from any two different cultures you often see that they don’t have the same number of words for similar things. Some languages have very few words for the English word “death”, and some have a lot of different words with similar meanings. And the same is true for men and women I would sayss

    Like

  14. In Italian, where the letters “ch” always make a hard c sound (like “chiaroscuro” or “Machiavelli”), the word “fuchsia” is pronounced “fook-sya.” Apparently the word itself comes from a German botanist named Fuchs, which I believe is pronounced similarly with a hard c. When it came into English, the spelling was retained, but it was pronounced according to the English pronunciation. So that may be a good way to remember it.

    Like

  15. Wow, I never realized there was such a depth to the gender color thing. Very interesting though, so glad i stumbled upon your blog. Can’t wait to show this to my girlfriend I know she will enjoy it as well

    Like

  16. The “names most disproportionately popular among men” had me cracking up. Then I would calm down, re-read it, and start laughing again.

    Like

Comments are closed.