Summer Internship Projects Portal - BITS Pilani, Pilani Campus

Project ID

BITS-SRIP/6CCCA0/2026

Project Title

Method and Model Development for Distinguishing Human-Written vs LLM Generated Content

Project Description

The rapid adoption of large language models (LLMs) has made it increasingly difficult to distinguish between human-written and AI-generated content in academic writing, news articles, social media, and online platforms. This raises critical concerns related to academic integrity, misinformation, content authenticity, and trust in digital communication. This project aims to develop robust methods and machine learning models to automatically detect whether a given text is written by a human or generated by an AI system. The project will explore linguistic, stylistic, statistical, and representation-level cues that differentiate human and AI text, and will design hybrid detection models combining classical NLP features with modern deep learning approaches. Objectives are:
Identify linguistic and statistical patterns that distinguish human vs AI text
Build machine learning and deep learning models for detection
Study robustness across domains, topics, and AI models
Analyze failure cases and adversarial scenarios
Explore explainability and confidence calibration of detectors
OUTCOMES:
A curated benchmark dataset of human-written and AI-generated texts across multiple domains (essays, news, answers, reports, etc.).

A set of baseline and advanced detection models

A systematic evaluation of:

Cross-model generalization (GPT, Claude, etc.)

Cross-domain robustness

Project Discipline

Computer Science, AI and Data Science

Faculty Name

vinti agarwal

Department

Department of Computer Science & Information Systems

Project Details