RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 14 days ago

A $196 fine-tuned 7B model outperforms OpenAI o3 on document extraction

3

10

A $196 fine-tuned 7B model outperforms OpenAI o3 on document extraction

RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 14 days ago

3

Extract-0: A Specialized Language Model for Document Information Extraction

This paper presents Extract-0, a 7-billion parameter language model specifically optimized for document information extraction that achieves performance exceeding models with parameter counts several orders of magnitude larger. Through a novel combination of synthetic data generation, supervised fine-tuning with Low-Rank Adaptation (LoRA), and reinforcement learning via Group Relative Policy Optimization (GRPO), Extract-0 achieves a mean reward of 0.573 on a benchmark of 1,000 diverse document extraction tasks, outperforming GPT-4.1 (0.457), o3 (0.464), and GPT-4.1-2025 (0.459). The training methodology employs a memory-preserving synthetic data generation pipeline that produces 280,128 training examples from diverse document sources, followed by parameterefficient fine-tuning that modifies only 0.53% of model weights (40.4M out of 7.66B parameters). The reinforcement learning phase introduces a novel semantic similarity-based reward function that handles the inherent ambiguity in information extraction tasks. This research demonstrates that task-specific optimization can yield models that surpass general-purpose systems while requiring substantially fewer computational resource.

Chat

Dionysus@leminal.space
link
fedilink
English
arrow-up
1·
13 days ago
And deepseek is based on llama, more than six figures.

I’m not aware of any larger parameter LLMs not based on one which is absurdly expensive.
- mindbleach@sh.itjust.works
  link
  fedilink
  English
  arrow-up
  1·
  13 days ago
  DeepSeek is trained from-scratch. Only some variants used other LLMs.
  
  This is a megaphone made from string, a squirrel, and a megaphone.

Hacker News@lemmy.bestiver.se

hackernews@lemmy.bestiver.se

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hackernews@lemmy.bestiver.se

Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

322 users / day
1.71K users / week
3.89K users / month
9.48K users / 6 months
1 local subscriber
2.8K subscribers
23.1K Posts
11.6K Comments
Modlog