Searching...
Flashcards in this deck (22)
  • What was the pet project discussed in the video?

    Analyzing fanfiction metadata and tag relationships.

    fanfiction metadata
  • What inspired the pet project?

    A discussion about weird fanfiction pairings with a roommate.

    inspiration discussion
  • What is the size of the metadata dump from Archive of Our Own?

    Over 400 megabytes.

    data archive
  • What type of data does the metadata dump contain?

    Creation date, language, completion status, word count, and tags.

    data metadata
  • How many fanfictions are in the metadata dump?

    More than 7 million.

    data fanfictions
  • What is the purpose of the tags in the metadata?

    To categorize and identify fanfictions.

    tags categorization
  • What can be merged in the tag data?

    Identical tags representing the same concept, like character names.

    tags merging
  • What tool can be used for analyzing the CSV files?

    A script to combine information and create character pairings.

    analysis csv
  • What does the script do with the tag IDs?

    It relates tag IDs to fanfictions and creates pairings.

    script pairings
  • What program is mentioned for creating visualizations?

    Gey.

    program visualization
  • What type of graph is implemented by GY?

    An undirected graph

    graph software
  • What is a limitation of GY when handling fanfiction data?

    It struggles with too many fanfiction entries (40.5 million)

    software data
  • How many individual fanfictions did the author limit their analysis to?

    250,000 individual fanfictions

    data fanfiction
  • What are some popular themes in the fanfiction data?

    • Harry Potter
    • Marvel
    • Percy Jackson
    • DC
    • Anime
    • K-pop
    • Gaming
    • Dream S&P
    fanfiction themes
  • What major shift is observed in fanfiction themes over time?

    Growth in Marvel, Supernatural, and Game of Thrones

    fanfiction themes trends
  • What criteria were used for character inclusion in the analysis?

    Characters appeared at least 10 times

    data characters
  • What was eliminated from the character graph to manage complexity?

    Characters that appeared less frequently

    data characters
  • What does a direct edge between two characters indicate?

    They appear in the same fanfiction

    graph relationships
  • What concept is similar to finding the distance between characters in fanfiction?

    The Wikipedia game

    game concept
  • What happens the further characters are from each other in fanfiction?

    The longer it would take to get from one character to another.

    fanfiction characters
  • What game is mentioned that is similar to the concept described?

    The Wikipedia game.

    games wikipedia
  • What is the analogy used to describe the distance between characters?

    A little bit like the Wikipedia game.

    analogy games