top of page
Em Dash Analysis: Detecting AI-Generated Content on Reddit
- The project analyzes em dash frequency in Reddit comments across tech-related subreddits to identify potential AI-generated content, operating under the hypothesis that AI models may exhibit distinct punctuation patterns compared to human users.
- The repository `v4nn4/em-dash-conspiracy` contains the code for performing this analysis, suggesting a practical implementation of the detection method.
- _Limitations:_ The effectiveness of em dash frequency as a sole indicator of AI-generated text is not explicitly validated, and may be susceptible to confounding factors such as subreddit-specific writing styles or individual user preferences.
Source:
bottom of page