« PREVIOUS ENTRY
Virtual Taps
NEXT ENTRY »
DOESN’T ANYONE SPEAK ENGLISH AROUND HERE??
I’ve written before about Zipf’s Law — a concept invented in the early 20th century by the social scientist George Zipf. Zipf counted word occurence in hundreds of newspaper and magazine articles and found that while English has about 26,000 common words, over 90% of everything we say or write uses merely 2,000 of them. So if you plotted the most-common to least-common words on a graph, you’d see the first few spiking way up high, then quickly dropping down to an almost flat line as you get past the 2,000 most common words. A Zipf Curve looks like a ski slope.
Later on, the economist and sociologist Herbert Simon offered an explanation for this. He pointed out that words gain meaning the more they’re used — which gives the first few words in a text “first mover” advantage, since they’re helping to define the topic under discussion. For example, I’m more likely to re-use words “occurence” or “meaning” in the rest of this article, while probably never using the word “lawnmower.” That’s part of how a text builds meaning: By introducing a few key words and repeating them, over and over again. (On a broader scale, that’s probably how language evolved too, and possibly why the Zipf Curve exists.)
Anyway, a physicist recently decided to see if music behaves the same way. Damian Zanette of the Balseiro Institute studied the occurrence of notes in several pieces of music. Presto: They, too, had Zipf-Curve distributions. What’s really interesting is when he compared the distributions in “pleasant” music — like Mozart — versus atonal, “difficult” music, like Schoenberg. As Nature reports:
The pieces by Bach, Mozart and Debussy all produced a relatively steep graph, suggesting a strong relationship between rank and frequency, and therefore a high level of meaningful context. In other words, if you have heard part of the piece, it is relatively easy to predict what kind of thing will come next. Zanette adds that jazz pieces he tested showed a similar pattern.
But the Schoenberg piece, one of the first truly atonal works, had a much flatter graph. This means that the piece does not have a set vocabulary of commonly used words that keep appearing. Instead, the size of the vocabulary increases at about the same rate as the length of the piece; new “words” are constantly introduced, while earlier ones are seldom repeated.
I'm Clive Thompson, a writer on science, technology, and culture. This blog collects bits of offbeat research I'm running into, and musings thereon.
Currently, I'm a contributing writer for the New York Times Magazine and a columnist for Wired magazine. I also write for Fast Company and Wired magazine's web site, among other places. Email or AOL IM me (pomeranian99) to say hi or send in something strange!
The “Milky Way Transit Authority” map
Should automobile software be open-sourced?
My Bookforum review of Jaron Lanier’s “You Are Not A Gadget”
Molecular secrets of the “iron-plated snail”
» visit the Collision Detection archives
January 31, 2010 » 07:29 PM
V. A. To me death seems to be an evil.
M. What, to those who are already dead? or to those who must die?
A. To both.
M. It is a misery, then, because an evil?
A. Certainly.
M. Then those who have already died, and those who have still got to die, are both miserable?
A. So it appears to me.
M. Then all are miserable?
A. Every one.
January 24, 2010 » 03:22 PM
One of the more interesting trends is family, which came in at number five. Specifically, discussion about family, moms, dads, daughters, etc. jumped during 2009. With Facebook users getting older, this isn’t a big surprise. However, the fact that the mention of “kids” jumped by a factor of five this year is rather dramatic. It’s tough to know what this means, though. (via Facebook Unveils Most-Mentioned Topics of 2009
)
January 15, 2010 » 01:36 PM
BEYOND AWESOME. They are announcing a recall of the Plush Uterus “due to a potential choking hazard for children”. To apply for it, “Please send an email to the address below with the subject line, ‘UTERUS OPT OUT’”.
January 14, 2010 » 10:04 PM
“To order, please TYPE “YES” IN CHECKBOX BELOW TO AGREE YOU UNDERSTAND THIS PLUSH MUST BE KEPT AWAY FROM KIDS (it is a sex organ, after all). If it is not checked, WE WILL NOT SEND THE UTERUS.” (via @ibogost)
January 11, 2010 » 01:45 PM
I watched Space: 1999 back in the day, but I swear to god I do not remember this scene.
» see all of my photos on Flickr
ECHO
Erik Weissengruber
Vespaboy
Terri Senft
Tom Igoe
El Rey Del Art
Morgan Noel
Maura Johnston
Cori Eckert
Heather Gold
Andrew Hearst
Chris Allbritton
Bret Dawson
Michele Tepper
Sharyn November
Gail Jaitin
Barnaby Marshall
Frankly, I'd Rather Not
The Shifted Librarian
Ryan Bigge
Nick Denton
Howard Sherman's Nuggets
Serial Deviant
Ellen McDermott
Jeff Liu
Marc Kelsey
Chris Shieh
Iron Monkey
Diversions
Rob Toole
Donut Rock City
Ross Judson
Idle Words
J-Walk Blog
The Antic Muse
Tribblescape
Little Things
Jeff Heer
Abstract Dynamics
Snark Market
Plastic Bag
Sensory Impact
Incoming Signals
MemeFirst
MemoryCard
Majikthise
Ludonauts
Boing Boing
Slashdot
Atrios
Smart Mobs
Plastic
Ludology.org
The Feature
Gizmodo
game girl
Mindjack
Techdirt Wireless News
Corante Gaming blog
Corante Social Software blog
ECHO
SciTech Daily
Arts and Letters Daily
Textually.org
BlogPulse
Robots.net
Alan Reiter's Wireless Data Weblog
Brad DeLong
Viral Marketing Blog
Gameblogs
Slashdot Games