New paper · SkillAudit: From Fixed-Suite Benchmarking to Skill-Centered Assessment

MMPCBench

Type
Benchmark
Year
2026
Status
active

A benchmark for measuring how multimodal language models reconstruct missing product text or imagery and support recommendation.

MMPCBench evaluates missing-modality completion in product catalogues, where absent images or descriptions can weaken both product presentation and downstream recommendation.

The benchmark contains a content-quality track for generated modalities and a recommendation track for measuring whether completed product representations remain useful in ranking systems.