A concept of providing boundaries for a machine learning model to stay away from topics, categories, etc. of given outputs it has been promoted to produce. Examples could be the types of language it has been prompted to say or types of questions it has been asked to answer.