[GSoC] Train DeBERTa NER model for required phrase prediction

continuation of #5077. uses the BIOES dataset produced there

## Goal
Finetune DeBERTa-v3-large to predict required phrase spans in license rule text using BIOES labels

## Tasks
- Subword tokenization with label alignment (-100 for continuation tokens)
- Training with class-weighted loss (handle O vs B/I/E/S imbalance)
- Include negative samples (rules without markers, all-O labels) for balanced training
- Evaluation: token F1, exact span match
- ONNX export for CPU inference


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSoC] Train DeBERTa NER model for required phrase prediction #5137

Goal

Tasks

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Uh oh!

[GSoC] Train DeBERTa NER model for required phrase prediction #5137

Description

Goal

Tasks

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions