top of page

Em Dash Analysis: Detecting AI-Generated Content on Reddit

  • The project analyzes em dash frequency in Reddit comments across tech-related subreddits to identify potential AI-generated content, operating under the hypothesis that AI models may exhibit distinct punctuation patterns compared to human users.
  • The repository `v4nn4/em-dash-conspiracy` contains the code for performing this analysis, suggesting a practical implementation of the detection method.
  • _Limitations:_ The effectiveness of em dash frequency as a sole indicator of AI-generated text is not explicitly validated, and may be susceptible to confounding factors such as subreddit-specific writing styles or individual user preferences.
Source:
bottom of page