Today, I'm reading philosophy. A classic issue is "the problem of universals" ( summary : what makes the word chair, apply to this thing, and not that thing? ).
So, it reminds me that this, "leaky abstraction", is at the heart of problems around "governance failure" in current LLMs, and with people in general (e.g., "never do X" or "always do Y" ).
It's nice nowadays, to be able to take ancient fuzzy concepts and apply mechanistic interpretation to them!
No comments :
Post a Comment