Skip to main content

Task and Config Matrix

This page is not an exhaustive metric catalog. Its purpose is to map tasks to starter configs so you can begin from an existing setup instead of authoring YAML from scratch.

Object-Centric

Starter ConfigTypical taskNotes
object_centric/subject_add.yamlsubject_addAdd a subject or object
object_centric/subject_remove.yamlsubject_removeRemove a target subject
object_centric/subject_replace.yamlsubject_replaceReplace a subject
object_centric/color_alter.yamlcolor_alterModify color attributes
object_centric/material_alter.yamlmaterial_alterModify material attributes
object_centric/size_adjustment.yamlsize_adjustmentResize a target
object_centric/text_editing.yamltext_editingEdit text inside an image
object_centric/cref.yamlcrefCharacter or subject reference tasks
object_centric/oref.yamlorefObject reference tasks

Human-Centric

Starter ConfigTypical taskNotes
human_centric/ps_human.yamlps_humanPortrait or human-appearance editing
human_centric/motion_change.yamlmotion_changeHuman pose or motion editing

VLM-as-a-Judge

Starter ConfigUse caseNotes
vlm_as_a_judge/openai.yamlAPI backendOpenAI-style chat endpoint
vlm_as_a_judge/gemini.yamlGoogle-style backendGemini-compatible backend
vlm_as_a_judge/vllm.yamlSelf-hosted backendLocal or internal vLLM service
vlm_as_a_judge/eval_if.yamlBenchmark evalInstruction-following judge prompt
vlm_as_a_judge/eval_vc.yamlBenchmark evalVisual-consistency judge prompt
vlm_as_a_judge/eval_vq.yamlBenchmark evalVisual-quality judge prompt

Benchmark Keys

eval reads benchmark definitions from configs/datasets/bmk.json. Repository-visible keys include:

  • openedit
  • vc_reward
  • editscore_consistency
  • editscore_prompt_following
  • editreward_visual_quality

How to use this matrix

  1. Decide whether you need structured scores or pairwise winners.
  2. Choose the pipeline family.
  3. Start from the closest existing config instead of creating a blank YAML file.