Language Model Contains Personality Subnetworks

(arxiv.org)

57 points | by PaulHoule 3 days ago ago

34 comments