<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Name Inspector &#187; Orthography</title>
	<atom:link href="http://www.thenameinspector.com/category/orthography/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.thenameinspector.com</link>
	<description>Tells you what makes names tick.</description>
	<lastBuildDate>Wed, 28 Jul 2010 23:34:00 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Why the &#8220;millionth word&#8221; story is silly</title>
		<link>http://www.thenameinspector.com/why-the-millionth-word-story-is-silly/</link>
		<comments>http://www.thenameinspector.com/why-the-millionth-word-story-is-silly/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 20:08:28 +0000</pubDate>
		<dc:creator>The Name Inspector</dc:creator>
				<category><![CDATA[Bad Linguistics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Orthography]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Words]]></category>
		<category><![CDATA[idioms]]></category>
		<category><![CDATA[snowclones]]></category>

		<guid isPermaLink="false">http://www.thenameinspector.com/?p=399</guid>
		<description><![CDATA[On Wednesday the Global Language Monitor &#8220;announced&#8221; that English got its one-millionth word at precisely 10:22 am GMT that day. And the word was Web 2.0, so naturally, blogs such as  Mashable, John Battelle&#8217;s Searchblog,  and TechCrunch took notice.
Now, The Name Inspector realizes that the &#8220;millionth word&#8221; story is a ridiculous play for attention that&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>On Wednesday the<a href="http://www.languagemonitor.com/"> Global Language Monitor</a> &#8220;announced&#8221; that English got its one-millionth word at precisely 10:22 am GMT that day. And the word was <em>Web 2.0</em>, so naturally, blogs such as  <a href="http://mashable.com/2009/06/10/web20-millionth-word/">Mashable</a>, <a href="http://battellemedia.com/archives/004939.php">John Battelle&#8217;s Searchblog</a>,  and <a href="http://www.techcrunch.com/2009/06/10/now-that-its-the-one-millionth-word-web-20-can-be-retired-to-the-dictionary/">TechCrunch</a> took notice.</p>
<p>Now, The Name Inspector realizes that the &#8220;millionth word&#8221; story is a ridiculous play for attention that&#8217;s not to be taken seriously, and that the folks at the Global Language Monitor know it. But the story has gotten people talking about what a word is, and that&#8217;s a topic that The Name Inspector can warm to.</p>
<p>The easiest criticism of the millionth-word story is that <em>Web 2.0</em> isn&#8217;t a word, but a phrase. That&#8217;s the main thing that linguist Geoffrey Pullum had to say about the matter on <a href="http://languagelog.ldc.upenn.edu/nll/?p=1497">Language Log</a>. And that&#8217;s pretty disappointing, actually, because it ignores the fact that the whole enterprise of counting words that precisely is linguistically suspect.</p>
<p>Why would The Name Inspector object to counting words? Believe it or not, it&#8217;s not due to a perverse academic refusal to give simple answers to simple questions. The innocent word, which seems to be the very simplest little bit of language to understand, is remarkably hard to pin down. There are very clear examples of words, like <em>dog</em>, but around the edges the word category is fuzzy. That makes it hard to count words with any precision, let alone announce the exact time of day when a word enters the language.</p>
<p>Let&#8217;s start with the very dumbest definition of word, the one used by the &#8220;word count&#8221; function on your word processor: A word is a string of characters (lets say letters) with no spaces. Well, that would mean the following string consists of five words:<em> jjj akjsdhfjkh auygfh tg drqwds</em>.</p>
<p>We can do better than that: A word is a string of letters with no spaces that has a meaning and can be used in a sentence. By this definition, <em>Web 2.0</em> doesn&#8217;t cut it. And many people who&#8217;ve weighed in on the issue in blog comments have raised just that objection. Some object to the space, some to the digits, some to the punctuation. Sorry, sorry, and sorry. If inclusion in a dictionary is the ultimate proof of wordhood, then consider this: Even the abridged online version of the Merriam-Webster dictionary includes entries for <em>deep six</em>, <em>12-step</em>, <em>20/20</em>, <em>24-7</em>, <em>3 D</em>, and even <em>86ing </em>(a slang term for refusing to serve a customer). All these include numbers, all but one include digits, some have punctuation, and one has a space.</p>
<p>Now about spaces. It&#8217;s commonly accepted that English has complex prepositions that consist of parts. In some cases the parts are separated by spaces, and in others they&#8217;re not. We write <em>in lieu of</em> as three chunks, and <em>instead of</em> as two, even though their structures are parallel, etymologically speaking. Then there&#8217;s <em>notwithstanding</em>.</p>
<p>There are many compound words in the Merriam-Webster dictionary (and others) that are written with spaces. A space is a purely orthographic entity, and it&#8217;s silly to define a linguistic unit based on orthography alone. Spoken language is primary. Written language is, ultimately, a representation of spoken language. There are compounds that some people write as &#8220;one word&#8221; and some people write as &#8220;two words&#8221;, though the pronunciation remains constant. <em>Website</em>/<em>web site</em> is one example. If you use the no-space criterion, you end up saying that such expressions are sometimes words and sometimes not words, based on orthographic variation. And that just doesn&#8217;t make good sense.</p>
<p>Of course, you might appeal to The Language Boss to tell you which version is &#8220;correct&#8221;. But people, it&#8217;s time to wake up and realize that The Language Boss is a fiction, like the Wizard of Oz. There are just different people, sometimes with different opinions, bumbling around behind their curtains. Pay no attention to that language maven behind the curtain!</p>
<p>Lurking behind the orthographic issue, of course, is a deeper linguistic one: If some words are made of pieces that are themselves words, how do we know when a group of words adds up to a complex word as opposed to a phrase or a random stretch of language? Here linguists begin to rely on criteria that distance the definition of <em>word </em>from the pragmatic, what-you-list-in-the-dictionary understanding of what a word is. The linguists might, for example, think about how an expression interacts with the rules of English stress assignment, or about it&#8217;s syntactic behavior. In any event, for a group of words to add up to a complex word, it has to be a conventional, cohesive unit.</p>
<p>And here there are no hard and fast rules. Idioms make things especially complicated. Merriam-Webster lists <em>kick ass</em> and <em>kick the bucket</em> under its entry for <em>kick</em>. So these idioms get a sort of honorary word treatment. But notice that idioms don&#8217;t always occur in exactly the same form: we can <em>kick a little ass</em> or <em>kick some ass</em> or even <em>kick some Raider ass</em>. In idioms, words begin to blend into grammar, and that&#8217;s where things get really tricky.</p>
<p>Some idioms, like <em>kick the bucket </em>and <em>kick ass</em>, are identified mostly by the presence of certain component  words. Others, however, are more like grammatical templates. Consider sentences like <em>There&#8217;s only so far a car can go with a flat tire</em>, <em>There&#8217;s only so long you can sit before you have to get up and walk around</em>, and <a href="http://guides.ign.com/guides/16512/page_4.html">There&#8217;s only so often you can talk or sneak your way out of a fight</a>. There&#8217;s a pattern here that&#8217;s something like <em>There&#8217;s only so</em> <strong>X Y</strong> <em>can </em><strong>Z</strong>, where X is a scalar measure or property of some kind,  Y is a noun phrase, and Z is a verb phrase. Most people wouldn&#8217;t call this pattern a word, but it&#8217;s hard to find the exact barrier between this pattern and something like <em>kick ass</em>. (To see lots of patterns like this, you might take a look at the <a href="http://snowclones.org/">Snowclone Database</a>).</p>
<p>Even when you&#8217;re talking about words with simple forms, it can be hard to decide how to count them. That&#8217;s because words aren&#8217;t just forms&#8211;they also have meanings, and it&#8217;s often the case that the same form has more than one meaning. If the meanings are very different, we usually think of there being more than one word. For example, <em>bank </em>used in connection with a river is one word, and <em>bank </em>used in reference to a financial institution is another.</p>
<p>But what if the meanings are only a little different? How many &#8220;words&#8221; are represented by these different uses of the verb <em>see</em>?</p>
<blockquote><p>Can you see the car?<br />
I see that it&#8217;s raining.<br />
I don&#8217;t see why you&#8217;re so angry.<br />
Let&#8217;s go see grandma.<br />
Are you seeing anyone?<br />
I&#8217;ll see your twenty and raise you ten.<br />
Let me see you to your door.<br />
See to it that this doesn&#8217;t get out.</p></blockquote>
<p>All these complexities don&#8217;t mean it&#8217;s impossible in principle to count the number of words in the English language. They do, however, mean that it&#8217;s very, very hard, and that you have to know what you mean by <em>word </em>before you start.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.thenameinspector.com/why-the-millionth-word-story-is-silly/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Q gotta be kidding</title>
		<link>http://www.thenameinspector.com/q-gotta-be-kidding/</link>
		<comments>http://www.thenameinspector.com/q-gotta-be-kidding/#comments</comments>
		<pubDate>Thu, 27 Sep 2007 21:13:29 +0000</pubDate>
		<dc:creator>The Name Inspector</dc:creator>
				<category><![CDATA[Company Names]]></category>
		<category><![CDATA[Names]]></category>
		<category><![CDATA[Naming Stories]]></category>
		<category><![CDATA[Orthography]]></category>
		<category><![CDATA[Trademark Issues]]></category>
		<category><![CDATA[Words]]></category>

		<guid isPermaLink="false">http://www.thenameinspector.com/q-gotta-be-kidding/</guid>
		<description><![CDATA[Last Friday&#8217;s New York Times had an article about QVC, described as &#8220;the leading home shopping network&#8221;. QVC is one of those boring abbreviation names&#8212;it stands for &#8220;Quality, Value, Convenience&#8221;. Apparently QVC wants to jazz up its image with a rebranding of sorts, and has a new ad campaign prominently featuring the letter Q. The [...]]]></description>
			<content:encoded><![CDATA[<p>Last Friday&#8217;s New York Times had <a href="http://www.thenameinspector.com/1348113600&amp;en=a6303fe4326c2d2a&amp;ei=5124&amp;partner=permalink&amp;exprod=permalink">an article about QVC</a>, described as &#8220;the leading home shopping network&#8221;. QVC is one of those boring abbreviation names&#8212;it stands for &#8220;Quality, Value, Convenience&#8221;. Apparently QVC wants to jazz up its image with a rebranding of sorts, and has a new ad campaign prominently featuring the letter Q. The Times quoted Jeff Charney, QVC&#8217;s chief marketing officer: &#8220;We&#8217;d really like to own the 17th letter of the alphabet&#8221;.</p>
<p>This isn&#8217;t the first time someone has tried to own a letter. <a href="http://www.medialifemagazine.com/news2003/mar03/mar10/2_tues/news3tuesday.html">According to MediaLife Magazine</a>, <a href="http://www.oprah.com/omagazine/omag_landing.jhtml">O, The Oprah Magazine</a>, which goes by the nickname of <strong>O Magazine</strong>, was sued for trademark infringement in 2001 by a German publication called <strong>O Magazine</strong>. Judge John Koeltl, who dismissed the lawsuit, reportedly said that O, The Oprah Magazine, which features stories about women&#8217;s health and well-being, and O Magazine, which features photographs of women in fetish attire, are &#8220;devoted to different aspects of women&#8217;s lives&#8221;. Indeed.</p>
<p>Now the folks at QVC may have had a purely metaphorical, fun, marketing use of the word <em>own </em>in mind. But companies often do get all hot and bothered about trying to own our cultural commons more literally. When The Name Inspector worked at a language technology company in the late 1990s, one of the executives thought we should seek a patent on our code because it would be &#8220;like having a patent on the English language&#8221;. Such hubris, such naivete.</p>
<p>Ah, now The Name Inspector is taking a trip down memory lane, and is reminded of another story from the late 1990s, which <a href="http://itre.cis.upenn.edu/~myl/languagelog/archives/000618.html">Geoffrey Pullum wrote</a> about in 2004. James Pustejovsky, a famous computational lexical semanticist (at least, about as famous as a computational lexical semanticist can be), started a natural language processing company in 1997. First he called it <strong>Lexeme</strong>. This is a perfectly nice real-word name. <em>Lexeme </em>is a technical term in linguistics that means, basically, &#8216;word&#8217;. But a large company threatened to sue for infringement upon their trademark. Any guesses about which trademark it was? <strong>Lexus</strong>. Yep, for the luxury division of Toyota. Somehow they believed they owned the sequence of letters <em>lex</em>, despite the fact that this is an ancient Greek root found in English words like <em>lexicon</em>, <em>lexical</em>, and <em>dyslexia</em>.</p>
<p>As Pullum wrote, &#8220;Aren&#8217;t the owners of the English language ever going to rise up against greedy corporations like Lexus and <a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/000581.html">Microsoft</a> and <a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/000611.html">Star</a><a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/000613.html">bucks</a> who lay claim to whole regions of the phonetosphere&#8230;?&#8221;. The prospect of huge legal fees forced Pustejovsky to change the name of his company, but he found his own way to fight back: he called his company  <strong>LingoMotors</strong>, no doubt intending a dig at Lexus. But The Name Inspector fears that fighting linguistic land grabs with little witticisms is a bit like the idea, lampooned in Woody Allen&#8217;s movie &#8220;Manhattan&#8221;, of countering a Nazi rally with a devastating editorial.</p>
<p><small>Tags: <a href="http://technorati.com/tag/q" rel="tag">q</a>, <a href="http://technorati.com/tag/the+letter+q" rel="tag"> the letter q</a>, <a href="http://technorati.com/tag/qvc" rel="tag"> qvc</a>, <a href="http://technorati.com/tag/abbreviations" rel="tag"> abbreviations</a>, <a href="http://technorati.com/tag/lexeme" rel="tag"> lexeme</a>, <a href="http://technorati.com/tag/the+name+lexeme" rel="tag"> the name lexeme</a>, <a href="http://technorati.com/tag/lingomotors" rel="tag"> lingomotors</a>, <a href="http://technorati.com/tag/lexus" rel="tag"> lexus</a>, <a href="http://technorati.com/tag/o+magazine" rel="tag"> o magazine</a>, <a href="http://technorati.com/tag/oprah+magazine" rel="tag"> oprah magazine</a>, <a href="http://technorati.com/tag/oprah" rel="tag"> oprah</a></small></p>
]]></content:encoded>
			<wfw:commentRss>http://www.thenameinspector.com/q-gotta-be-kidding/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Village of the dots: The name Kijiji</title>
		<link>http://www.thenameinspector.com/kijiji/</link>
		<comments>http://www.thenameinspector.com/kijiji/#comments</comments>
		<pubDate>Wed, 11 Jul 2007 22:57:34 +0000</pubDate>
		<dc:creator>The Name Inspector</dc:creator>
				<category><![CDATA[Company Names]]></category>
		<category><![CDATA[Foreign Word Names]]></category>
		<category><![CDATA[Names]]></category>
		<category><![CDATA[Orthography]]></category>
		<category><![CDATA[Real Word Names]]></category>

		<guid isPermaLink="false">http://www.thenameinspector.com/kijiji/</guid>
		<description><![CDATA[
We&#8217;re really getting a chance to enjoy the graphic possibilities of the letters i and j lately. If you&#8217;re a blogger you can put a Lijit Wijit on your blog. And now there&#8217;s Kijiji. Actually, there has been Kijiji for a while now in other countries, but eBay has just launched a U.S. version of [...]]]></description>
			<content:encoded><![CDATA[<p><img id="image120" alt="kijiji-phonetic1.jpg" src="http://www.thenameinspector.com/wp-content/uploads/phonetic-reps/kijiji-phonetic.jpg" /></p>
<p>We&#8217;re really getting a chance to enjoy the graphic possibilities of the letters <strong>i</strong> and <strong>j</strong> lately. If you&#8217;re a blogger you can put a <strong><a href="http://www.lijit.com">Lijit</a> <a href="http://www.widgetbox.com/widget/lijit">Wijit</a> </strong>on your blog. And now there&#8217;s <a href="http://www.kijiji.com/"><strong>Kijiji</strong></a>. Actually, there has been Kijiji for a while now in other countries, but eBay has just launched a U.S. version of this local classified ad network, putting them in direct competition with Craigslist.</p>
<p>Just look at all those dots and vertical lines. When the name is written in a serif font, as it is in the company logo, the dots look like heads, the vertical lines look like torsos, and the serifs look like arms reaching out in a welcoming embrace or unbridled enthusiasm or something else great like that. You would be forgiven for thinking that this unusual name was invented just to achieve this graphical effect. But actually <em>kijiji </em>is a Swahili word meaning &#8216;village&#8217;. It&#8217;s a diminutive form of the word <em>mji</em>, which means &#8216;town&#8217;. So it&#8217;s almost as if in Swahili they call a village a <em>townlet</em>.</p>
<p>Why choose Swahili for the name of an international network of classified ad sites? There are two good reasons. First, Swahili is a <em>lingua franca</em>&#8211;a common language used for business by speakers of other languages&#8211;in East Africa. Native speakers live in and around Tanzania.</p>
<p>Second, Swahili syllables tend to conform to what phonologists consider the universally preferred syllable structure, which is a single consonant followed by a single vowel, represented CV. (An exception is the first syllable of the word <em>mji</em>, which  consists entirely of the nasal [m] sound. That may seem exotic, but in English we actually have syllables consisting only of nasal sounds, like the final syllable of the word <em>button</em>. We just don&#8217;t put those syllables at the beginnings of words.)</p>
<p>What does it mean for the CV syllable to be universally preferred? For one thing, it means kids produce this kind of syllable first when acquiring language. It also means that this kind of syllable is found throughout the languages of the world, while other kinds of syllable are more likely not to be allowed in this or that language. Most importantly, it means that in theory just about everybody in the world should find it pretty easy to pronounce this name.</p>
<p>About the first part of <strong>Kijiji</strong>: Have you studied French or Spanish or German or some other language and struggled with the system of grammatical gender? Well, if you didn&#8217;t enjoy that, steer clear of Swahili. It has more than ten genders, or <em>noun classes</em>, as they&#8217;re commonly called.  Each noun starts with a prefix like <em>ki-</em> or <em>m-</em> showing its class. When a noun is used as a subject, verbs and some other words must be marked for agreement with its class prefix. At least the noun classes are based more on meaning than the random gender systems of other languages are, so it&#8217;s often easy to guess which class a word belongs to.</p>
<p>Anyway, The Name Inspector is getting off topic. How does this name do in English? Well, it&#8217;s actually a little hard to pronounce, preferred syllable structure notwithstanding. The two affricate sounds followed by high front vowels are kind of awkward to squeeze out. And despite the graphical gimmick of the name (or perhaps because of it), the orthography is a little hard to parse. When you see this name on the page or the screen, it looks like a bunch of scratches&#8211;it&#8217;s hard to distinguish the letters.</p>
<p>So this name makes perfect sense from a semantic and sociolinguistic point of view, but it suffers from the very orthographic and phonetic properties that make it special.</p>
<p><small>Tags: <a href="http://technorati.com/tag/kijiji" rel="tag">kijiji</a>, <a href="http://technorati.com/tag/the+name+kijiji" rel="tag"> the name kijiji</a>, <a href="http://technorati.com/tag/swahili" rel="tag"> swahili</a>, <a href="http://technorati.com/tag/lijit" rel="tag"> lijit</a>, <a href="http://technorati.com/tag/wijit" rel="tag"> wijit</a>, <a href="http://technorati.com/tag/lingua+franca" rel="tag"> lingua franca</a>, <a href="http://technorati.com/tag/syllables" rel="tag"> syllables</a></small></p>
]]></content:encoded>
			<wfw:commentRss>http://www.thenameinspector.com/kijiji/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
