arXiv

AutoTail-BSFGM: Class-Balance-Aware Fine-Tuning for Chinese Scholarly Text Classification

Title: AutoTail-BSFGM: Class-Balance-Aware Fine-Tuning for Chinese Scholarly Text Classification

Abstract:

While scholarly text classification facilitates literature organization, subject indexing, and research intelligence, Chinese scholarly corpora frequently suffer from imbalanced data and semantically adjacent disciplinary labels. To address these challenges, we introduce AutoTail-BSFGM, a class-balance-aware fine-tuning approach. This method integrates an automatically gated tail-prior adjustment, a weak Balanced Softmax auxiliary loss, and Fast Gradient Method (FGM) adversarial regularization. Notably, AutoTail-BSFGM modifies only the training objective and procedure; during inference, it utilizes the same single base-size encoder and linear classifier as the corresponding label-smoothed baseline.

We assessed the proposed method on two tasks derived from the CSL dataset: an abstract-to-discipline classification involving 67 labels, and a title-to-category task comprising 13 categories. For the primary abstract task, AutoTail-BSFGM enhanced both validation and lockbox accuracy when applied to Chinese RoBERTa-WWM and MacBERT-base models. Specifically, using MacBERT-base, validation accuracy rose by 0.83 percentage points and lockbox accuracy by 0.49 points, with a pooled paired McNemar test indicating statistical significance on validation (p = 0.023). In the title-to-category task, the method increased validation accuracy by 0.70 points and validation balanced accuracy by 2.64 points. While lockbox accuracy remained approximately neutral, lockbox balanced accuracy improved by 1.22 points. These findings indicate a bounded contribution: AutoTail-BSFGM enhances class-balance-sensitive performance and delivers consistent improvements for abstract-based scholarly classification, although it does not uniformly boost every metric across all splits.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...