Publication:

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Date

 
dc.contributor.authorD'Oosterlinck, Karel
dc.contributor.authorXu, Winnie
dc.contributor.authorDevelder, Chris
dc.contributor.authorDemeester, Thomas
dc.contributor.authorSingh, Amanpreet
dc.contributor.authorPotts, Christopher
dc.contributor.authorKiela, Douwe
dc.contributor.authorMehri, Shikib
dc.contributor.imecauthorD'Oosterlinck, Karel
dc.contributor.imecauthorDevelder, Chris
dc.contributor.imecauthorDemeester, Thomas
dc.contributor.orcidimecD'Oosterlinck, Karel::0000-0003-1695-1014
dc.contributor.orcidimecDevelder, Chris::0000-0003-2707-4176
dc.contributor.orcidimecDemeester, Thomas::0000-0002-9901-5768
dc.date.accessioned2025-05-19T10:36:19Z
dc.date.available2025-05-17T05:45:22Z
dc.date.available2025-05-19T10:36:19Z
dc.date.issued2025
dc.description.wosFundingTextWe thank Kawin Ethayarajh, Eugen Hotaj, and Nathan Lambert for their feedback. We thank Stas Bekman for his help and support. K. D. gratefully acknowledges funding from the FWO Fundamental Research PhD Fellowship (11632223N). We also thank our anonymous reviewers for their valuable comments, which helped improve the clarity and quality of this work.
dc.identifier.doi10.1162/tacl_a_00748
dc.identifier.issn2307-387X
dc.identifier.urihttps://imec-publications.be/handle/20.500.12860/45682
dc.publisherMIT PRESS
dc.source.beginpage442
dc.source.endpage460
dc.source.issue/
dc.source.journalTRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS
dc.source.numberofpages19
dc.source.volume13
dc.title

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

dc.typeJournal article
dspace.entity.typePublication
Files

Original bundle

Name:
8806.pdf
Size:
873.91 KB
Format:
Adobe Portable Document Format
Description:
Publication available in collections: