Abhishek Ahuja – Medium

Abhishek Ahuja

A visual summary of Learning to Summarize with Human Feedback

An interesting paper on how to train large-scale human-in-the-loop Language Models with focus on preference alignment. The architecture is…

Oct 19, 2020

A visual summary of Learning to Summarize with Human Feedback

Oct 19, 2020

Abhishek Ahuja

Abhishek Ahuja

MSCS@UMass Amherst.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech