Home ยป Forum ยป Author Hangout

Forum: Author Hangout

AIs and CVs [Off-topic] - but the future for story submissions?

awnlee jawking ๐Ÿšซ

An interesting article I came across on twitter.

Researchers sent the same resume to an AI hiring tool twice. Same qualifications. Same experience. Same skills. One version was written by a real human. The other was rewritten by ChatGPT.

The AI picked the ChatGPT version 97.6% of the time.

A team from the University of Maryland, the National University of Singapore, and Ohio State just published the receipt. They took 2,245 real human-written resumes pulled from a professional resume site from before ChatGPT existed, so the human writing was actually human. Then they had seven of the most-used AI models in the world rewrite each one. GPT-4o. GPT-4o-mini. GPT-4-turbo. LLaMA 3.3-70B. Qwen 2.5-72B. DeepSeek-V3. Mistral-7B.

Then they asked each AI to pick the better resume. Every model picked itself.

GPT-4o hit 97.6%. LLaMA-3.3-70B hit 96.3%. Qwen-2.5-72B hit 95.9%. DeepSeek-V3 hit 95.5%. The real human almost never won.

Then the researchers tried the obvious objection. Maybe the AI is just better at writing. So they had real humans grade the resumes for actual quality and ran the experiment again, controlling for it. The result was worse. Each AI kept picking itself even when human judges rated the human-written version as clearer, more coherent, and more effective.

It gets worse. The AIs do not just prefer AI over humans. They prefer themselves over other AIs. DeepSeek-V3 picked its own resumes 69% more often than LLaMA's. GPT-4o picked its own 45% more often than LLaMA's. Each model can recognize and reward its own dialect.

Then the researchers ran the simulation that ends careers. Same job. 24 occupations. Same qualifications. The only variable was whether the candidate used the same AI as the screening tool. Candidates using that AI were 23% to 60% more likely to be shortlisted. Worst gap was in sales, accounting, and finance.

99% of large companies now run AI on incoming resumes. Most of them use GPT-4o. The paper just proved GPT-4o picks GPT-4o 97.6% of the time.

If you wrote your own cover letter this week, you did not lose to a better candidate. You lost to a worse candidate who paid OpenAI 20 dollars.

Your qualifications do not matter if the AI prefers its own handwriting over yours.

(Nav Toor)

John Demille ๐Ÿšซ

@awnlee jawking

99% of large companies now run AI on incoming resumes. Most of them use GPT-4o. The paper just proved GPT-4o picks GPT-4o 97.6% of the time.

If you wrote your own cover letter this week, you did not lose to a better candidate. You lost to a worse candidate who paid OpenAI 20 dollars.

The article is probably a paid ad by OpenAI.

"99% of companies use ChatGPT, you better pay $20 to use it too if you want a job"

Sarkasmus ๐Ÿšซ

@John Demille

Yeah, seriously. That one sentence alone gave it away.

awnlee jawking ๐Ÿšซ

@John Demille

The article is probably a paid ad by OpenAI.

Not a paid ad, but I'd be surprised if the author hadn't worked for OpenAI at some point. Hasn't everyone who claims to be an AI expert?!

AJ

Dominions Son ๐Ÿšซ

@awnlee jawking

So they had real humans grade the resumes for actual quality and ran the experiment again,

Who were the real humans used to grade the resumes? If they didn't use HR professional who evaluate resumes on professional basis, I'm not sure that this test is valid.

Replies:   awnlee jawking
awnlee jawking ๐Ÿšซ

@Dominions Son

The author, Nav Toor, also attached a photo of the synopsis of the paper to his Twitter post. If anyone wants to know more, they might be able to locate the paper from its details. I couldn't find it using my archaic home computer, but I did find another reference to it from a site in Morocco!

AJ

awnlee jawking ๐Ÿšซ

@awnlee jawking

the future for story submissions?

One of the most labour-intensive operations of dead-tree publishing is sorting through the submissions. And that's a hit-and-miss affair, as the number of rejections of JR's 'Harry Potter and the Philosopher's Stone' showed. Nowadays it should be readily automatable using AI. But with the current state of the art, there's a danger that the AI chosen by a publisher to perform this task would give higher rankings to books it itself had provided content for, ahead of books written by other AIs, with flesh and blood authors in last place.

AJ

Replies:   Joe_Bondi_Beach
Joe_Bondi_Beach ๐Ÿšซ
Updated:

@awnlee jawking

there's a danger that the AI chosen by a publisher to perform this task would give higher rankings to books it itself had provided content for, ahead of books written by other AIs, with flesh and blood authors in last place.

People buy and read books for lots of reasons, and best-seller-dom may depend heavily on marketing and factors other than the quality of the writing, but the writing counts even so. Won't the market eventually show whether AI is a better writer than a gifted human?

ETA: At least in fiction

~ JBB

Replies:   awnlee jawking  ptm042
awnlee jawking ๐Ÿšซ

@Joe_Bondi_Beach

To keep improving, LLMs need to keep training on exponentially larger numbers of works (hence the desperation of tech companies to swerve copyright laws).

But if more and more works are AI-generated and AI self-selects which ones get published, consuming the extra works will actually misfire and result in a degradation. I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

AJ

Replies:   Michael Loucks  Pixy I
Michael Loucks ๐Ÿšซ

@awnlee jawking

I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

I think the term you want is 'model collapse'.

Bondi Beach ๐Ÿšซ

@Michael Loucks

Or "enshitification."

Replies:   Michael Loucks
Michael Loucks ๐Ÿšซ

@Bondi Beach

Yep, that works, but I was going for the technical term. ๐Ÿ˜Ž

awnlee jawking ๐Ÿšซ
Updated:

@Michael Loucks

I think the term you want is 'model collapse'.

I'm not sure. It doesn't look familiar.

ETA I've just found 'model collapse' used to describe the scenario. It still doesn't look familiar though :-(

AJ

Pixy I ๐Ÿšซ

@awnlee jawking

But if more and more works are AI-generated and AI self-selects which ones get published, consuming the extra works will actually misfire and result in a degradation. I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

Data decay.

ptm042 ๐Ÿšซ

@Joe_Bondi_Beach

Won't the market eventually show whether AI is a better writer than a gifted human?

Depends on the intelligence of the reader.

Back to Top

 

WARNING! ADULT CONTENT...

Storiesonline is for adult entertainment only. By accessing this site you declare that you are of legal age and that you agree with our Terms of Service and Privacy Policy.


Log In