Forum: Author Hangout

AIs and CVs [Off-topic] - but the future for story submissions?

awnlee jawking 🚫

An interesting article I came across on twitter.

Researchers sent the same resume to an AI hiring tool twice. Same qualifications. Same experience. Same skills. One version was written by a real human. The other was rewritten by ChatGPT.

The AI picked the ChatGPT version 97.6% of the time.

A team from the University of Maryland, the National University of Singapore, and Ohio State just published the receipt. They took 2,245 real human-written resumes pulled from a professional resume site from before ChatGPT existed, so the human writing was actually human. Then they had seven of the most-used AI models in the world rewrite each one. GPT-4o. GPT-4o-mini. GPT-4-turbo. LLaMA 3.3-70B. Qwen 2.5-72B. DeepSeek-V3. Mistral-7B.

Then they asked each AI to pick the better resume. Every model picked itself.

GPT-4o hit 97.6%. LLaMA-3.3-70B hit 96.3%. Qwen-2.5-72B hit 95.9%. DeepSeek-V3 hit 95.5%. The real human almost never won.

Then the researchers tried the obvious objection. Maybe the AI is just better at writing. So they had real humans grade the resumes for actual quality and ran the experiment again, controlling for it. The result was worse. Each AI kept picking itself even when human judges rated the human-written version as clearer, more coherent, and more effective.

It gets worse. The AIs do not just prefer AI over humans. They prefer themselves over other AIs. DeepSeek-V3 picked its own resumes 69% more often than LLaMA's. GPT-4o picked its own 45% more often than LLaMA's. Each model can recognize and reward its own dialect.

Then the researchers ran the simulation that ends careers. Same job. 24 occupations. Same qualifications. The only variable was whether the candidate used the same AI as the screening tool. Candidates using that AI were 23% to 60% more likely to be shortlisted. Worst gap was in sales, accounting, and finance.

99% of large companies now run AI on incoming resumes. Most of them use GPT-4o. The paper just proved GPT-4o picks GPT-4o 97.6% of the time.

If you wrote your own cover letter this week, you did not lose to a better candidate. You lost to a worse candidate who paid OpenAI 20 dollars.

Your qualifications do not matter if the AI prefers its own handwriting over yours.

(Nav Toor)

Replies: John Demille Dominions Son awnlee jawking EricR

John Demille 🚫

@awnlee jawking

99% of large companies now run AI on incoming resumes. Most of them use GPT-4o. The paper just proved GPT-4o picks GPT-4o 97.6% of the time.

If you wrote your own cover letter this week, you did not lose to a better candidate. You lost to a worse candidate who paid OpenAI 20 dollars.

The article is probably a paid ad by OpenAI.

"99% of companies use ChatGPT, you better pay $20 to use it too if you want a job"

Replies: Sarkasmus awnlee jawking

Sarkasmus 🚫

@John Demille

Yeah, seriously. That one sentence alone gave it away.

awnlee jawking 🚫

@John Demille

The article is probably a paid ad by OpenAI.

Not a paid ad, but I'd be surprised if the author hadn't worked for OpenAI at some point. Hasn't everyone who claims to be an AI expert?!

AJ

Dominions Son 🚫

@awnlee jawking

So they had real humans grade the resumes for actual quality and ran the experiment again,

Who were the real humans used to grade the resumes? If they didn't use HR professional who evaluate resumes on professional basis, I'm not sure that this test is valid.

Replies: awnlee jawking

awnlee jawking 🚫

@Dominions Son

The author, Nav Toor, also attached a photo of the synopsis of the paper to his Twitter post. If anyone wants to know more, they might be able to locate the paper from its details. I couldn't find it using my archaic home computer, but I did find another reference to it from a site in Morocco!

AJ

awnlee jawking 🚫

@awnlee jawking

the future for story submissions?

One of the most labour-intensive operations of dead-tree publishing is sorting through the submissions. And that's a hit-and-miss affair, as the number of rejections of JR's 'Harry Potter and the Philosopher's Stone' showed. Nowadays it should be readily automatable using AI. But with the current state of the art, there's a danger that the AI chosen by a publisher to perform this task would give higher rankings to books it itself had provided content for, ahead of books written by other AIs, with flesh and blood authors in last place.

AJ

Replies: Joe_Bondi_Beach

Joe_Bondi_Beach 🚫
Updated:

@awnlee jawking

there's a danger that the AI chosen by a publisher to perform this task would give higher rankings to books it itself had provided content for, ahead of books written by other AIs, with flesh and blood authors in last place.

People buy and read books for lots of reasons, and best-seller-dom may depend heavily on marketing and factors other than the quality of the writing, but the writing counts even so. Won't the market eventually show whether AI is a better writer than a gifted human?

ETA: At least in fiction

~ JBB

Replies: awnlee jawking ptm042

awnlee jawking 🚫

@Joe_Bondi_Beach

To keep improving, LLMs need to keep training on exponentially larger numbers of works (hence the desperation of tech companies to swerve copyright laws).

But if more and more works are AI-generated and AI self-selects which ones get published, consuming the extra works will actually misfire and result in a degradation. I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

AJ

Replies: Michael Loucks Pixy I

Michael Loucks 🚫

@awnlee jawking

I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

I think the term you want is 'model collapse'.

Replies: Bondi Beach awnlee jawking

Bondi Beach 🚫

@Michael Loucks

Or "enshitification."

Replies: Michael Loucks

Michael Loucks 🚫

@Bondi Beach

Yep, that works, but I was going for the technical term. 😎

awnlee jawking 🚫
Updated:

@Michael Loucks

I think the term you want is 'model collapse'.

I'm not sure. It doesn't look familiar.

ETA I've just found 'model collapse' used to describe the scenario. It still doesn't look familiar though :-(

AJ

Pixy I 🚫

@awnlee jawking

But if more and more works are AI-generated and AI self-selects which ones get published, consuming the extra works will actually misfire and result in a degradation. I've seen and forgotten the technical name for it, and I'd appreciate a reminder if anyone has it at their fingertips.

Data decay.

ptm042 🚫

@Joe_Bondi_Beach

Won't the market eventually show whether AI is a better writer than a gifted human?

Depends on the intelligence of the reader.

EricR 🚫

@awnlee jawking

This is old news. AI was used in resume screening long before OpenAI came along. Job seekers have been dealing with AI screeners for close to a decade now.

I don't think the experiment is that relevant in any case. An AI cleaned resume is going to be very generic. Where AI IS valuable is in customizing a resume to the job requirements, and to the interview process. If you're going to apply for a job, taking your resume and the job description and other relevant information and asking the AI to produce a resume that maps to the job requirements plus a powerful cover letter that reflects your direct experiences that map to the requirements can be a very effective use of the tools. If you enter the interview process, using the AI to research each interviewers background, public statements, etc and then asking the AI to become an interview coach for that particular interview with that particular interviewer can also be very powerful.

People who use it to generate generic "improvements" are setting themselves up for disappointment.

Replies: awnlee jawking

awnlee jawking 🚫

@EricR

That doesn't explain why each AI prefers its own handiwork over CVs that human experts consider 'perfect'.

AJ

Replies: LupusDei

LupusDei 🚫

@awnlee jawking

I'm not surprised, but still somewhat unconvinced how real the observed results were, and it can be just about structure, but...

Each AI has own dialect, even if in a very subtle way.

There was a research related/similar to the ChatGPT nerdy gremlins, but come out before that. They intentionally trained AI model to have a favorite word catchphrase. Then made that model to teach another model math. They communicated only numbers, no text whatsoever. Nevertheless, the student model was statistically more likely to mention teacher AI's catchphrase... inexplicably.

There may be something similar in play. It's not unthinkable that a model may "subconsciously" recognize own output, especially if used verbatim, and judging it better quality is just... human, I guess? And while they are not, neither conscious nor human, they are mimicry machines trained on (originally) human output and may inherent traits like self-indulgence from us through similar higher level structures in play in the mentioned experiments even when impartiality may be stressed on the surface in training.

Reply to topic

Forum: Author Hangout

AIs and CVs [Off-topic] - but the future for story submissions?

WARNING! ADULT CONTENT...