Accepted at ECCV 2026

Fine-grained recognition · Dataset distillation

Distill the details
that make a difference.

FD² preserves subtle, localized cues while building a compact synthetic dataset—turning fine-grained variation from a distillation obstacle into a learning signal.

Read the paper ↗ View on GitHub ↗

FrameworkDecoupled DD

FocusLocalized cues

VenueECCV 2026

Hongxu Ma^1,†· Guang Li^2,†,*· Shijie Wang³· Dongzhan Zhou⁴· Baoli Sun⁵· Takahiro Ogawa²· Miki Haseyama²· Zhihui Wang⁵

¹ Zhejiang University ² Hokkaido University ³ The University of Queensland ⁴ Shanghai AI Laboratory ⁵ Dalian University of Technology

† Equal contribution * Correspondence to Guang Li (guang@lmd.ist.hokudai.ac.jp)

01 / Motivation

Fine-grained data asks a harder question.

Conventional decoupled distillation is guided by coarse class labels. On fine-grained datasets, that can blur the tiny distinctions between classes—and make distilled images within a class too alike.

A bird is not just a bird. The answer may live in the wing, the beak, or a patch of color.

Conventional DDCoarse

Same-class collapse

Samples inherit nearly identical optimization signals, limiting local diversity.

FD²Fine-grained

Discriminative diversity

Class prototypes guide identity while attention constraints preserve distinct cues.

02 / Framework

One framework.
Three deliberate stages.

FD² slots into the established decoupled dataset-distillation pipeline without rewriting the recipe.

Model pretraining

Counterfactual attention learning discovers discriminative regions and aggregates their representations into class prototypes.

Build prototypes

Sample distillation

A characteristic constraint pulls each sample toward its own prototype and away from others. A similarity constraint diversifies same-class attention.

Distill details

Soft-label generation

The pretrained fine-grained model converts distilled samples into informative soft labels for efficient downstream training.

Transfer knowledge

FD squared framework: model pretraining, sample distillation, and soft-label generation — FD² augments a decoupled distillation pipeline with counterfactual attention learning (CAL), fine-grained characteristic alignment, and same-class attention diversity.

03 / Design

Identity,
without imitation.

Two complementary objectives keep the synthetic set faithful to its class and meaningfully varied within it.

ℒ_F

Fine-grained characteristic

Be closer to your class.

Align each representation with its class prototype while repelling prototypes from other classes.

ℒ_S

Attention similarity

Look somewhere different.

Reduce overlap between current and previous attention maps from the same class.

04 / Takeaway

The result

A compact dataset that remembers where to look.

Across fine-grained and general datasets, FD² integrates seamlessly with decoupled dataset distillation and improves performance in most settings—showing strong transferability.

Fine-grained awarePlug-and-playTransferable

05 / Citation

Build on the details.

If FD² supports your research, please cite our paper.

arXiv ↗ GitHub ↗

@inproceedings{ma2026fd2,
  title   = {FD$^2$: A Dedicated Framework for
             Fine-Grained Dataset Distillation},
  author  = {Ma, Hongxu and Li, Guang and Wang, Shijie
             and Zhou, Dongzhan and Sun, Baoli and
             Ogawa, Takahiro and Haseyama, Miki and
             Wang, Zhihui},
  booktitle = {European Conference on Computer Vision (ECCV)},
  year    = {2026}
}

Distill the detailsthat make a difference.