Skip to content
@kaustpradalab

PRADALab_KAUST

Provable Responsible AI and Data Analytics (PRADA) Lab, KAUST

Popular repositories Loading

  1. research-handboook research-handboook Public

    83 5

  2. Fraud-R1 Fraud-R1 Public

    [ACL 2025 Findings] Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

    Python 24 3

  3. LLM-Persona-Steering LLM-Persona-Steering Public

    Official code of "Exploring the Personality Traits of LLMs through Latent Features Steering"

    Python 16 2

  4. repeat-curse-llm repeat-curse-llm Public

    [ACL 2025 Findings] Understanding the Repeat Curse in Large Language Models from a Feature Perspective

    Python 16 2

  5. SAE-Factory SAE-Factory Public

    Training SAEs for your LLM, and visualize it in one place

    Python 7

  6. CoT-Dataset CoT-Dataset Public

    Jupyter Notebook 5 1

Repositories

Showing 10 of 21 repositories
  • LLM-sycophancy Public

    [AAAI'26 Main🎉] Official code of "When Truth Is Overridden: Uncovering the Internal Origins of Sycophancy in Large Language Models"

    kaustpradalab/LLM-sycophancy’s past year of commit activity
    Python 5 0 0 0 Updated Nov 11, 2025
  • flashdp Public
    kaustpradalab/flashdp’s past year of commit activity
    Python 4 Apache-2.0 3 0 0 Updated Jul 1, 2025
  • Fraud-R1 Public

    [ACL 2025 Findings] Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

    kaustpradalab/Fraud-R1’s past year of commit activity
    Python 24 3 1 0 Updated Jun 29, 2025
  • repeat-curse-llm Public

    [ACL 2025 Findings] Understanding the Repeat Curse in Large Language Models from a Feature Perspective

    kaustpradalab/repeat-curse-llm’s past year of commit activity
    Python 16 2 0 0 Updated Jun 13, 2025
  • ECBM Public
    kaustpradalab/ECBM’s past year of commit activity
    Python 2 0 1 0 Updated May 27, 2025
  • zo2 Public Forked from liangyuwang/zo2

    ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

    kaustpradalab/zo2’s past year of commit activity
    Python 3 Apache-2.0 17 0 0 Updated Apr 13, 2025
  • draft Public Forked from kaustpradalab/zo2

    Privately Fine-Tuning Extremely Large Language Models with Zeroth-Order Offloading

    kaustpradalab/draft’s past year of commit activity
    Python 0 Apache-2.0 17 0 0 Updated Mar 10, 2025
  • vanilla-RLAIF-pipeline Public Forked from mengdi-li/vanilla-RLAIF-pipeline

    An implementation of a vanilla RLAIF pipeline, utilizing GPT-2-Large for the summarization task with the TL;DR dataset.

    kaustpradalab/vanilla-RLAIF-pipeline’s past year of commit activity
    Python 1 1 0 0 Updated Feb 5, 2025
  • LLM-Persona-Steering Public

    Official code of "Exploring the Personality Traits of LLMs through Latent Features Steering"

    kaustpradalab/LLM-Persona-Steering’s past year of commit activity
    Python 16 2 0 0 Updated Jan 30, 2025
  • SAE-Factory Public

    Training SAEs for your LLM, and visualize it in one place

    kaustpradalab/SAE-Factory’s past year of commit activity
    Python 7 Apache-2.0 0 0 0 Updated Nov 4, 2024

Most used topics

Loading…