Avatar

so long as it's words

@solongasitswords / solongasitswords.tumblr.com

Shouty, sweary, enthusiastic Linguistics Everything wonderful, fascinating and bizarre about language OUR BLOG
FOLLOW US ON TWITTER

Map shows UK’s weirdest place names.

The fuck is Westward Ho! ?

some things:

both crapstone and blotusfleming are near where i grew up curry mallet is not far from where i went to school (there is also shepton mallet, helpfully known to my classmates as “shit’n’smell-it”, both near glastonbury) nether wallop is on the way through to my aunt’s old house and i used to force my family to drive through there (also upper and middle wallop) and i once met the girl whose family owned pretty much that entire area and her name was clementine wallop my grandmother grew up in wales and llanfairpwllgwyngyllogogerychwyrnrobwilllantysiliogogogoch is her party piece from where pity me is located i already do scotland’s lack of fucks can be seen from space

horrid hill is in, *squints*, kent? i bet it is

PENISTONE. 

Curry Mallet’s barely even weird, although maybe I’m just used to it. Nempnett Thrubwell is much weirder. It made it into The Meaning of Liff as ‘The feeling experienced when driving off for the first time on a brand new motorbike.‘

there’s also a Pett Bottom in Kent.

May I add for consideration Keith Briggs’ map of place-names containing cunt? Yes? OK.

settle this for me once and for all

is “chai” a TYPE of tea??! bc in Hindi/Urdu, the word chai just means tea

its like spicy cinnamon tea instead of bland gross black tea

I think the chai that me and all other Muslims that I know drink is just black tea

i mean i always thought chai was just another word for tea?? in russian chai is tea

why don’t white people just say tea

do they mean it’s that spicy cinnamon tea

why don’t they just call it “spicy cinnamon tea”

the spicy cinnamon one is actually masala chai specifically so like

there’s literally no reason to just say chai or chai 

They don’t know better. To them “chai tea” IS that specific kind of like, creamy cinnamony tea. They think “chai” is an adjective describing “tea”.

What English sometimes does when it encounters words in other languages that it already has a word for is to use that word to refer to a specific type of that thing. It’s like distinguishing between what English speakers consider the prototype of the word in English from what we consider non-prototypical.

(Sidenote: prototype theory means that people think of the most prototypical instances of a thing before they think of weirder types. For example: list four kinds of birds to yourself right now. You probably started with local songbirds, which for me is robins, blue birds, cardinals, starlings. If I had you list three more, you might say pigeons or eagles or falcons. It would probably take you a while to get to penguins and emus and ducks, even though those are all birds too. A duck or a penguin, however, is not a prototypical bird.)

“Chai” means tea in Hindi-Urdu, but “chai tea” in English means “tea prepared like masala chai” because it’s useful to have a word to distinguish “the kind of tea we make here” from “the kind of tea they make somewhere else”.

“Naan” may mean bread, but “naan bread” means specifically “bread prepared like this” because it’s useful to have a word to distinguish between “bread made how we make it” and “bread how other people make it”.

We also sometimes say “liege lord” when talking about feudal homage, even though “liege” is just “lord” in French, or “flower blossom” to describe the part of the flower that opens, even though when “flower” was borrowed from French it meant the same thing as blossom. 

We also do this with place names: “brea” means tar in Spanish, but when we came across a place where Spanish-speakers were like “there’s tar here”, we took that and said “Okay, here’s the La Brea tar pits”.

 Or “Sahara”. Sahara already meant “giant desert,” but we call it the Sahara desert to distinguish it from other giant deserts, like the Gobi desert (Gobi also means desert btw).

Languages tend to use a lot of repetition to make sure that things are clear. English says “John walks”, and the -s on walks means “one person is doing this” even though we know “John” is one person. Spanish puts tense markers on every instance of a verb in a sentence, even when it’s abundantly clear that they all have the same tense (”ayer [yo] caminé por el parque y jugué tenis” even though “ayer” means yesterday and “yo” means I and the -é means “I in the past”). English apparently also likes to use semantic repetition, so that people know that “chai” is a type of tea and “naan” is a type of bread and “Sahara” is a desert. (I could also totally see someone labeling something, for instance, pan dulce sweetbread, even though “pan dulce” means “sweet bread”.)

Also, specifically with the chai/tea thing, many languages either use the Malay root and end up with a word that sounds like “tea” (like té in Spanish), or they use the Mandarin root and end up with a word that sounds like “chai” (like cha in Portuguese).

So, can we all stop making fun of this now?

Okay and I’m totally going to jump in here about tea because it’s cool. Ever wonder why some languages call tea “chai” or “cha” and others call it “tea” or “the”? 

It literally all depends on which parts of China (or, more specifically, what Chinese) those cultures got their tea from, and who in turn they sold their tea to. 

The Portuguese imported tea from the Southern provinces through Macau, so they called tea “cha” because in Cantonese it’s “cha”. The Dutch got tea from Fujian, where Min Chinese was more heavily spoken so it’s “thee” coming from “te”. And because the Dutch sold tea to so much of Europe, that proliferated the “te” pronunciation to France (”the”), English (”tea”) etc, even though the vast majority of Chinese people speak dialects that pronounce it “cha” (by which I mean Mandarin and Cantonese which accounts for a lot of the people who speak Chinese even though they aren’t the only dialects).

And “chai”/”chay” comes from the Persian pronunciation who got it from the Northern Chinese who then brought it all over Central Asia and became chai.

(Source

This is the post that would make Uncle Iroh join tumblr

Tea and linguistics. My two faves.

I love this

I co-wrote a post for The Toast! About Old English and historical accuracy and how we choose to represent the past.

And this is one of the dilemmas of creating historical fiction; what do you sacrifice because it would be misunderstood by modern audiences? What do you alter because it’s important to the story? Are these little extras like bonus Easter Eggs, or are they just a shortcut to history?
”…there’s nothing about a James, a Hannah or a Kate that makes them more or less like any other James, Hannah or Kate. Each James is unique, as is each Hannah and each Kate.
So names are just labels we use to refer to specific people and can’t be used in any general sense. At least that’s the theory. But then there’s When Harry Met Sally.”
We have a new blog post! By our excellent friend James, writing about the meaning behind personal names, invoking Brooklyn Nine Nine, When Harry Met Sally and Flight of the Concords, as well as some top-notch language info. Give it a read!

Why’s it called a blowjob when you suck not blow?

It was originally called a belowjob, standing for ‘below the belt’ But because people are shit at English, the two words sound the same, so over time it became known as Blow and not Below.

Damn

FACT OF THE DAY FOR MY FUCKING FRIENDS

Avatar

Thank you this has bothered me.

literally no truth in this and now my OED account shows that i looked up ‘blowjob’. thanks tumblr.

the verb ‘to blow’ was used that way in the 1930s, and ‘blowjob’ was created from that although the exact reasons for ‘blow’ are uncertain. There’s a good discussion here WHY do i always end up writing about swearwords.

Language learners’ levels of weirdness.

ICELANDIC-BASQUE PIDGIN??!

Level 8: any conlang not associated with a media franchise and not meant for international communication: ascended to another plane of existence.

And yeah, the Icelandic-Basque pidgin really existed. But it’s not even the weirdest Basque-based pidgin we have evidence of. Let me give you: the Algonquian-Basque pidgin!

Avatar

What even are these examples of Icelandic-Basque pigeon though??

My family’s native language, which I grew up speaking, is far from a niche language. Bengali is the seventh most common native language in the world, sitting ahead of the eighth (Russian) by a wide margin, with as many native speakers as French, German, and Italian combined. And yet, on the Internet, Bengali is very much a second-class citizen – as are Arabic (#5), Hindi (#4), and Mandarin (#1) – any language which is not written with the Latin alphabet. The very first version of the Unicode standard did include Bengali. However, it left out a number of important characters. Until 2005, Unicode did not have one of the characters in the Bengali word for “suddenly”. Instead, people who wanted to write this everyday word had to combine three separate, unrelated characters. For English-speaking teenagers, combining characters in unexpected ways, like writing ‘w’ as ‘\/\/’, used to be a way of asserting technical literacy through “l33tspeak” – a shibboleth for nerds that derives its name from the word “elite”. But Bengalis were forced to make similar orthographic contortions just to write a simple email: ত + ্ + ‍ = ‍ৎ (the third character is the invisible “zero width joiner”). Even today, I am forced to do this when writing my own name. My name is not only a common Indian name, but one of the top 1,000 names in the United States as well. But the final letter has still not been given its own Unicode character, so I have to use a substitute… I am not the only one who has trouble writing their name correctly in Unicode. Linguistically, East Asian languages such as Chinese, Japanese, and Korean have distinct writing systems. Some (but not all) of the characters trace their lineage back to a common set, but even these characters, known as Han characters, began to diverge and evolve independently over two thousand years ago. The Unicode Consortium has launched a very controversial project known as Han Unification: an attempt to create a limited set of characters that will be shared by these so-called “CJK languages.” Instead of recognizing these languages as having their own writing systems that share some common ancestry, the Han unification process views them as mere variations on some “true” form. To help English readers understand the absurdity of this premise, consider that the Latin alphabet (used by English) and the Cyrillic alphabet (used by Russian) are both derived from Greek. No native English speaker would ever think to try “Greco Unification” and consolidate the English, Russian, German, Swedish, Greek, and other European languages’ alphabets into a single alphabet. Even though many of the letters look similar to Latin characters used in English, nobody would try to use them interchangeably. ҭЋаt ωoulδ βε σutragєѳuѕ. Even though our language is exempt from this effort, Han unification is particularly troubling for Bengali speakers to hear about. The rhetoric is a blast from our own colonial past, when the British referred to Indian languages pejoratively as “dialects”. Depriving their colonial subjects of distinct linguistic identities was a key tactic in justifying their brutal rule over an “uncivilized” people.

One of my favorite linguistic phenomena is rebracketing, which is when a word or words is/are redivided differently, either two words becoming one, one word heard as two, or part of one word interpreted as part of the other.  This frequently happens with articles, for example:

  • apron was originally napron, but “a napron” was interpreted as “an apron”
  • newt comes from ewt by the same process
  • nickname comes from Middle English nekename which in turn came from ekename (an ekename -> a nekename) where “eke” was an old word meaning “also” or “additional” (so basically “an additional name”)
  • ammunition comes from French amunition, which came from munition, the phrase la munition being heard as l’amunition.
  • the nickname Ned comes from Ed, via “mine Ed” being heard as “my Ned” (in archaic English, “my” and “mine” had the same relationship as “a” and “an”), same with several other nicknames like Nell
  • The word “orange” ulimately derives from the Arabic nāranj, via French “orange”, the n being lost via a similar process involving the indefinite article, e.g., something like French “une norange” becoming “une orange” (it’s unclear which specific Romance language it first happened in)
  • in the Southern US at least (not sure about elsewhere), “another” is often analyzed as “a nother”, hence the phrase “a whole nother”
  • omelet has a whole series of interesting changes; it comes from French omelette, earlier alemette (swapping around the /l/ and /m/), from alemelle from an earlier lemelle (la lemelle -> l’alemelle)

Related to this, sometimes two words, especially when borrowed into another language, will be taken as one.  Numerous words were borrowed from Arabic with the definite article al- attached to them.  Spanish el lagarto became English alligator.  An interesting twist is admiral, earlier amiral (the d probably got in there from the influence of words like “administer”) from Arabic amir al- (lord of the ___), particularly the phrase amir al-bahr, literally “lord of the sea”.

Sometimes the opposite happens.  A foreign word will look like two words, or like a word with an affix.  For example, the Arabic kitaab was borrowed into Swahili as kitabu.  ki- happens to be the singular form of one of the Swahili genders, and so it was interpreted as ki-tabu.  To form the plural of that gender, you replace ki- with vi-, thus, “books” in Swahili is vitabu.  The Greek name Alexander became, in Arabic, Iskander, with the initial al- heard as the article al-.

Similarly, the English word Cherry came from Old Norman French cherise, with the s on the end interpreted as the plural -s.  Interestingly enough, that word came from Vulgar Latin ceresia, a feminine singular noun, but originally the plural of the neuter noun ceresium!  So a Latin plural was reinterpreted as a singular in Vulgar Latin, which in turn was interpreted as a plural when borrowed into English!

The English suffix -burger used with various foods (e.g., cheeseburger, or more informally chickenburger, etc.) was misanlyzed from Hamburger as Ham-burger, itself from the city of Hamburg

This can happen even with native words.  Modern French once is used for the snow leopard, but originally meant “lynx”.  In Old French, it was lonce (ultimately from the same source as lynx), which was reinterpreted as l’once!  In English, the word “pea” was originally “pease”, but that looked like it had the plural -s on it, and so the word “pea” was created from it.  Likewise, the adjective lone came from alone, heard as “a lone”, but alone itself came originally from all one.

Avatar

when i was 5 i remember my older sister called me “annuisance” a lot to her friends.

Avatar

Reader Chris passes along an article about differences in American Sign Language usage between white and African-American signers. Researchers investigating what they call Black ASL found significant variations in signs, signing space, and facial cues. They explain:

Black ASL is not just a slang form of signing. Instead, think of the two signing systems as comparable to American and British English: similar but with differences that follow regular patterns and a lot of variation in individual usage.

They hypothesize that these differences began in segregated learning environments, and continue to evolve in Black social spaces. The whole article is worth a read.

Thanks, Chris, and remember — you can submit Wonk-worthy links through our ask or via email!

ETA, 9/24/12: Many of you have brought up the use of the word “mainstream” in this infographic. Better choices definitely exist, since this word rings of othering. We appreciate your nuanced and attentive readership!

Avatar

hey what's up with the "!" in fandoms? i.e. "fat!" just curious thaxxx

Avatar

I have asked this myself in the past and never gotten an answer.

Maybe today will be the day we are both finally enlightened.

Avatar

It’s a way of referring to particular variations of (usually) a character — dark!Will, junkie!Sherlock, et cetera. I have suspected for a while that it originated from some archive system that didn’t accommodate spaces in its tags, so to make common interpretations/versions of the characters searchable, people started jamming the words together with an infix.

(Lately I’ve seen people use the ! notation when the suffix isn’t the full name, but is actually the second part of a common fandom portmanteau. This bothers me a lot but it happens, so it’s worth being aware of.)

Avatar

“Bang paths” (! is called a “bang"when not used for emphasis) were the first addressing scheme for email, before modern automatic routing was set up. If you wanted to write a mail to the Steve here in Engineering, you just wrote “Steve” in the to: field and the computer sent it to the local account named Steve. But if it was Steve over in the physics department you wrote it to phys!Steve; the computer sent it to the “phys” computer, which sent it in turn to the Steve account. To get Steve in the Art department over at NYU, you wrote NYU!art!Steve- your computer sends it to the NYU gateway computer sends it to the “art” computer sends it to the Steve account. Etc. (“Bang"s were just chosen because they were on the keyboard, not too visually noisy, and not used for a huge lot already).

It became pretty standard jargon, as I understand, to disambiguate when writing to other humans. First phys!Steve vs the Steve right next to you, just like you were taking to the machine, then getting looser (as jargon does) to reference, say, bearded!Steve vs bald!Steve.

So I’m guessing alternate character version tags probably came from that.

100% born of bang paths. fandom has be floating around on the internet for six seconds longer than there has been an internet so early users just used the jargon associated with the medium and since it’s a handy shorthand, we keep it.

Neat!  I have always wondered this!

It was probably adopted because early tagging iterations (on livejournal, mostly) used the space as its separator. So Bald Steve would register as “bald” “steve"—somewhat less useful than the bang-pathed “Bald!Steve”.

a lot of prescriptive linguists (the fancy term for snotty english majors, faux-talgic baby boomers, racist gatekeepers, and other subdivisions of the language police) like to shame The Youth for lazy capitalization and punctuation, but the interesting thing is that most young ppl i know who build their lives around texting are actually pretty damn deliberate about their language choices

“u” and “you” show degrees of closeness w/ your partner; using punctuation at the end of a reply text indicates tone; capitalizing certain words in the middle of the sentence is for Emphasis; sometimes we’re sloppy and sometimes we make mistakes but there is a real grammar to internet communication because by “grammar” i mean a “code of language rules that society agrees upon in order to create meaning”, and that is the opposite of being lazy

(tumblr absolutely has such a grammar and you can tell when someone’s not fluent)

the old guard is passionately defending a pure linguistic territory that we don’t want anymore, it’s not useful enough for 21st century relationships dependent on the subtleties of texts

How the World Laughs on the Web

We know that we can indicate laughter on the Internet in English by merely typing “haha”, but have you ever wondered how other languages indicate laughter in chats, text messaging and Social Media? We’ve compiled a list, just for you!

  • English – “hahaha”, “LOL”
  • Spanish – “jajaja”
  • Arabic – “ههههه” (“hhhhh” – Arabic doesn’t write short vowels, so that could be read as “hahahahaha”)
  • Thai – “55555″ (“5″ in Thai is pronounced “ha”)
  • French – “hahaha”, “héhéhé”
  • Russian – “хахаха” (“hahaha”), “бгггггг” (“bgggg”), “гггггг” (“gggggg”), “олололо” (“olololo”)
  • Ukrainian – “бгггггг” (“bhhhh”), “гггггг” (“hhhhhh”)
  • Catalan – “hahaha”
  • Portuguese – “hahaha”, “hashuashuashuashua”, “rá!”, “kkkkk”, “rsrsrs”
  • Korean – “ㅋㅋ” (“kk”), “ㅎㅎㅎ” (“hhh”)
  • Japanese – “wwww”, “ふふふ” (“huhuhu”)
  • Mandarin – “哈哈哈哈哈” (“hahahahaha”), “呵呵呵呵呵” (“hehehehehe”)
  • Indonesian – “wkwkwkwk”
  • Swedish – “hahaha”, “hehehe”, “hihihi”
  • Norwegian – “hæhæhæ”, “høhøhø”
  • Vietnamese – “hihihi”
Source: voxy.com