Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a dataset 6 days ago
entfane/violent_eval published a dataset 7 days ago
entfane/violent_eval updated a model 7 days ago
entfane/gpt2_constitutional_classifier_violence