ArXiv Implements Year-Long Bans for AI-Generated Paper Submissions

Abstract illustration of document filtering system representing ArXiv's enforcement of AI-generated paper detection

ArXiv, the world’s largest preprint repository hosting over 2.4 million research papers, has begun enforcing year-long submission bans against researchers who attempt to upload AI-generated manuscripts, marking one of the most aggressive institutional responses yet to the proliferation of machine-generated academic content.

The Cornell University-operated platform, which processes more than 200,000 submissions annually across physics, mathematics, computer science, and related fields, confirmed the policy enforcement this week following multiple reports of researchers receiving extended bans after submitting papers flagged as predominantly machine-generated.

The crackdown addresses mounting concerns about what platform moderators have termed “AI slop”—low-quality, machine-generated text that lacks the rigour and originality expected of genuine research contributions. ArXiv’s moderation team has reportedly identified a significant uptick in submissions bearing telltale signs of large language model generation, including repetitive phrasing, hallucinated citations, and logical inconsistencies that human authors would typically catch.

Unlike many academic publishers still developing their AI content policies, ArXiv has moved directly to enforcement. The year-long ban represents a substantial penalty in fast-moving fields where researchers routinely share preliminary findings on the platform before formal peer review. For early-career scientists and those in competitive subfields, a 12-month exclusion from ArXiv could materially impact their ability to establish priority for discoveries and maintain visibility within their research communities.

The policy does not prohibit all AI assistance in research writing. ArXiv’s guidelines permit the use of language models for editing, grammar correction, and translation—tasks that enhance rather than replace human intellectual contribution. The enforcement targets papers where AI systems have generated substantial portions of the scientific content itself, particularly methodology descriptions, results interpretation, and literature reviews.

Business Impact

The enforcement creates clear winners and losers in the academic publishing ecosystem. Traditional publishers and peer-reviewed journals stand to benefit as researchers may opt for slower but safer publication routes rather than risk ArXiv bans that could damage their reputations. Academic integrity software vendors, including Turnitin and iThenticate, are likely to see increased demand for AI detection capabilities as institutions seek to screen submissions before they reach platforms like ArXiv.

Conversely, AI writing assistant companies targeting academic users face reputational challenges. Tools marketed for “research paper generation” or “automated literature reviews” now carry significant professional risk for users. The policy may accelerate consolidation in this market segment, favouring established players like Grammarly that position themselves as editing aids rather than content generators.

For research institutions, the policy introduces new compliance considerations. Universities may need to implement pre-submission screening processes and provide clearer guidance to researchers about acceptable AI use—adding administrative overhead but potentially preventing career-damaging violations.

Enforcement Challenges

ArXiv’s moderation team faces technical limitations in detecting sophisticated AI-generated content. Current detection tools produce false positives, particularly for non-native English speakers whose writing patterns may resemble machine output. The platform has not disclosed its detection methodology, likely to prevent gaming of the system, but this opacity raises due process concerns for researchers who believe they have been incorrectly flagged.

The policy also creates incentives for researchers to develop detection-resistant AI writing tools, potentially triggering an arms race between content generators and platform moderators. Some AI companies have already begun marketing “undetectable” academic writing assistance, though the effectiveness and ethics of such tools remain contested.

What to Watch

Other major preprint servers, including bioRxiv and medRxiv, are expected to announce their enforcement approaches in coming months. The research community will be monitoring whether ArXiv’s strict stance becomes the standard or whether alternative platforms adopt more permissive policies that could fragment the preprint ecosystem.

Additionally, professional societies and funding bodies have yet to weigh in definitively on AI-assisted research writing. Their positions will likely shape institutional policies and determine whether ArXiv’s approach represents the beginning of sector-wide standards or an outlier position that researchers can circumvent by choosing alternative venues.

ArXiv’s enforcement marks a definitive end to the ambiguity surrounding AI-generated academic content, establishing clear professional consequences that extend beyond individual papers to researchers’ long-term publishing capabilities.