FRESH

Hacker News

Home

Human writers have always used the em dash

124 points by FromTheArchives

by AlecSchueler

6 subcomments

This isn't really convincing.
They say the models were trained on a bunch of books and that they learned the use of the dash from there. That's fine, no one is denying that humans have always used dashes in their books.
But where you would bet rarely see a dash would be something like a short product review, a YouTube comment or a WhatsApp message. In these contexts the dashes can and do seem out of place.

by mv4

2 subcomments

I am fairly confident the majority of my LinkedIn network are not experienced writers and don't know what em dash means. All make regular posts with em dashes in them. Their excessive use, combined with a certain presentation style, tells me it's ChatGPT. When I ask them they confirm it's ChatGPT.

by dragonwriter

0 subcomment

Rather, human typesetters of professionally printed material have always (well, since it was invented) used the em-dash. Handwritten dashes rarely clearly break down into categories that are clearly exactly matching one or the other typographical dash, and until relatively recently with proportional display fonts with large character sets and fancy input methods, typed (whether on typewriter or computer) text was unlikely to directly contain typographical dashes, though some systems (especially publishing/typesetting toolchains!) had system-specific, ad hoc means of representing them.
OTOH, as long as user-interactive web content has existed—so “always” in a context of a particular view of the online world—em-dashes have been part of it, because the facilities that make it easy to use (whether automatic replacement, or various keyboard input modifying mechanisms) have been sufficiently common that a robust minority of users have regularly used one or more of them.

by lapcat

0 subcomment

The article title is actually "Stop AI-Shaming Our Precious, Kindly Em Dashes—Please". The HN submission title is the subtitle.

by euroderf

3 subcomments

> I speak of the elegant, elongated hyphen, the gentle friend and ally of all writers, used to set off a chunk of text within a sentence.
There's nothing elegant about a punctuation mark firmly glued to the words on either side, making a sequoia-sized typographic log that typically gets wrapped in its entirety to the next line, leaving a half mile or so of white space just hanging in space before the wrap.
If you're gonna use the em dash, make sure your software can break a line on either side of one.

by K0balt

1 subcomments

Long live the em-dash!
I frequently am accused of using LLMs to write my prose, something that I not only eschew, but also believe is morally corrupt and intellectually dishonest.
I’m not above spellcheck, grammar checkers, or even LLM driven evaluation of articles, but my thoughts, word choices, and structure are always of my own design.
I use the em-dash where it is appropriate.
I find that people accusing writers of using AI typically disagree with the premise of the text, and use the “AI” character assault as a method of dehumanising the author and dismissal of their work. The assertion is very rarely made in good faith, but rather is used as a weak attempt to discredit an idea without actually refuting the premise or even examining the argument.
Shame on whoever argues in this way, it’s weak, unproductive, and intellectually lazy. It’s fine to disagree, but if you aren’t willing to act in good faith, just keep your thoughts to yourself. You’re only going to discredit your own point of view if you touch the keyboard.

by CaptWillard

0 subcomment

That's exactly what sentient AI would like us to believe.

by k__

2 subcomments

As a professional writer, I can confirm that my editors love to sprinkle em dahses excessively on my work.
Personally, I'm more prone to excessive semicolon usage, which seems to aggravate editors.

by tokai

2 subcomments

I'm just happy that LLMs don't seem particularly fond of semicolons; Their use should be reserved for the daring trailblazers that carve out their own path.

by gizajob

0 subcomment

Human writers with university degrees and who conform to the style guides where they’re free to use it.
Newspapers generally avoid it, even avoiding it completely in favour of commas. Properly wielding the n-dash or the m-dash requires training.

by antiloper

0 subcomment

Writers have used the em dash for centuries, certain members of internet forums and chatrooms have used them for two years. It's a tell.

by podgorniy

0 subcomment

In 2008-ish I was into web typography for if you may say so. We used to use special tools like https://www.artlebedev.ru/typograf/ to make text appear clear according to typography ideas. That included m-dashes. Amazing to see this subject surfacing again.

by currency

1 subcomments

I use em dashes constantly.
I've been a Mac user for years, where the em dash is a modified hyphen on the Mac keyboard. When I moved to primarily using PCs, the em dash alt-key combo was the first one I memorized (alt-0151).

by yung_steezy

0 subcomment

I use the em-dash quite often but tend to forget that I need to hit `-` twice to get it to appear in markdown. Used to be an oversight on my part but now I stick to it so people can tell I'm personally writing to them.

by serbuvlad

3 subcomments

Yes, people use the em dash. The point isn't the em dash itself. It's about U+2014. Yeah, in a book, or maybe a quality article, you'd type the em dash properly. But most of the time online? I write it as - or as --.

by cainxinth

1 subcomments

LLMs have also made the word “crucial” suspect. They use that one constantly.

by SkyeCA

4 subcomments

Normal people (myself included) are not particularly good at writing and would never use an emdash. The average person won't even use semicolons because of confusion about how to use them and at least those have a dedicated key.
I'm sorry to the professional writers out there, but if I see an emdash in a piece of throw away writing (like a reddit or HN comment) I assume it's AI generated and I now immediately stop reading it.

by khernandezrt

0 subcomment

I’ve never used an em dash in my life—but after having AI rewrite a lot of my emails I’m starting to use it more often, though incorrectly most of the time.

by Philadelphia

0 subcomment

We were taught to use em dashes as reporters 25 years ago, and I use them all the time in personal writing.

by foxyv

1 subcomments

The only time I used the em dash was when Microsoft word used to automatically add it to something I was typing. Usually it was formally typed stuff like essays and reports. I have never in my life used an em dash for anything else. Usually just a hyphen "-" at most.

by 1970-01-01

0 subcomment

Humans also have great grammar, spelling and punctuate they're sentences corectally

by FireBeyond

1 subcomments

I use the em dash as appropriate, similar to semicolons and their ilk.
I don't think use of an em dash is indicative in itself of AI assistance, but rather, the change to using them. Did this person all of a sudden start using them? There are also other things to look at, like how certain bullet point lists have emphasis (for key phrases, being bold, when previously the author didn't do so, stylistically).
I write a lot (as a PM) - I've taken to using MacWhisper, which does local AI dictation, but also (at my configuration) sends it to a ChatGPT prompt first:
"You are a professional proofreader and editor. Your task is to refine and polish the given transcript as follows:
1. Correct any spelling errors.
2. Fix grammatical mistakes.
3. Improve punctuation where necessary.
4. Ensure consistent formatting.
5. Clarify ambiguous phrasing without changing the meaning.
6. If a sentence or paragraph is overly verbose and has more than negligible redundancy, lightly edit for brevity.
7. If the transcript contains a question, edit it for clarity but do not provide an answer.
Please return only the cleaned-up version of the transcript. Do not add any explanations or comments about your edits."
This is great. I get the benefits of pretty accurate transcription while getting a first pass at copyediting almost in real time. It did require me to make some tweaks to my dictation process (allowing it to "chew" on larger chunks to give better context to its editing), but it works very well.

by Doctor_Fegg

4 subcomments

Fixed that for you: _American_ writers have always used the em dash. In British English orthography, space-en dash-space is much more common.

by musicale

0 subcomment

That's just the sort of thing an LLM would say.

by commandlinefan

0 subcomment

I never gave it much thought until I published my first book - then the editor insisted that I replace most of my parenthetical thoughts with emdash'ed inserts instead.

by teekert

2 subcomments

If you see text from me and it has an em-dash, it's 100% gen AI.

by mcv

2 subcomments

It seems to me that the article is missing the point somewhat. There's absolutely nothing wrong with the em-dash, but most people never use it (I don't think I've ever used it), because it doesn't appear on most standard keyboards.
If you encounter an em-dash in an online discussion, most likely someone went to extra effort to include it, or it was automatically inserted, possibly by an AI.
There are other signs that you're looking at AI-generated texts, like lists of three, a certain turn of phrase, or vague generalities, but those are easier for a human to type than an em-dash.

by Hilift

7 subcomments

"Point to the keys you press to enter the em dash". And smart quotes. My conjecture (and personal experience) is 99% of the occurrences of these characters is not due to pressing they corresponding keys, it is due to copy paste. So it should not be surprising or considered to be a personal attack on AI.

by Nevermark

0 subcomment

I use the em⸻dash.

by wavemode

0 subcomment

This article is attacking a strawman. Nobody was ever advocating for labeling all em dash usage as AI. Even the tweet they reference (not that random tweets ought to be taken as some sort of authoritative gauge of the current state of society...) does not claim that all em dash usage is AI.
In certain contexts, em dashes are perfectly natural and human. That being said, everyone has encountered articles and posts that read so obviously like AI, and in those contexts the presence of numerous em dashes is certainly an additional data point.

by globular-toast

0 subcomment

I used em dash liberally in my PhD thesis, mainly because I learnt how to do them in TeX. Thankfully that was a long time before LLMs.

by dcchambers

0 subcomment

I am a millennial and I grew up with computers. I was taught that it was grammatically proper to use dashes, not hyphens. Microsoft Word (and later, Google Docs) made this trivially easy because you could type two hyphens (--) and it would replace it with an em-dash character. I rarely write in Word or Google Docs these days, but when I do I still do that double-hyphen shortcut.
I think the main reason people are noticing it now is because most writing has moved away from legacy tools like Word. Websites like Twitter don't do that character substitution, so it has become quite obvious when text is being pasted from another place...for example, AI generated content.

by phyzome

0 subcomment

It's so stupid that this even needs to be said.
And yet here we ware.

by pessimizer

1 subcomments

I have no idea how this is a real article that people are wasting their time on.
Of course people use the em-dash, and of course LLMs use them at least 10x-100x more than your average human writer. Also, they add nothing to writing, 99.8% people just use an en-dash when typing where an em-dash would be used in print, and absolutely nothing is lost. Some dickheads (like myself) have used a compose key (or similar) to use actual em-dashes in order to seem sophisticated online.
The only people who need the em-dash, as far as I know, are Spanish-language writers. As for LLM-shaming, isn't it more shameful when you publish an article that could easily be entirely written by LLM, but definitely wasn't, like this one?
edit: articles like this make me want to misuse flagging.

by CivBase

3 subcomments

This article completely misses the point from the start.
The reason em dashes are a giveaway for AI generated text is simply because there is no em dash key on the keyboard - only an en dash key. The dash I used in that last sentence was an en dash, not an em dash.
Some publishing applications (including Microsoft Word) will automatically convert en dashes to em dashes where appropriate. But most email apps, chat apps, online posts/comments, and practically any application not designed for writing actual printed publications will not do that conversion for you. And without a dedicated key, it is far too cumbersome for most people to bother. They will just leave it as an en dash.
So yes, the em dash is still a reliable indicator of AI-generated content in many contexts.

by tolmasky

0 subcomment

I'm glad the em dash is getting properly shit on these days, if for unrelated reasons. I've never liked it. I hate the stupid spacing rules around it. It never looks right to put no spaces around the em dash, and probably breaks all sorts of word-splitting code that's based on "\s". Where else does punctuation without spaces not mean a single word? Hyphens without spaces is a compound word: it counts as one. Imagine if the correct use of a colon was to not put spaces around it:like this. Do you like that? Of course not.
But I think worst of all it just gives me the fucking creeps, some uncanny-valley bullshit. I see hyphens a million times a day then out of nowhere comes this creepy slender-man looking motherfucker that's just a little bit too long than you'd expect or like, and is always touching all the letters around it when it shouldn't need to. It stands out looking like a weird print error... on my screen! Hopefully it keeps building a worse and worse reputation.