Have students create a Python program that implements an automated authorship detection algorithm, which gives them an opportunity to practice working with strings, lists, and loops.

  • In automated authorship detection, a computer program analyzes a large collection of texts, one of which has an unknown author.
    • The program uses statistics, such as words per sentence, to analyze the similarity between two texts in order to determine the author of the unattributed text.
  • For the full assignment, visit Stanford’s nifty assignment page.