What was the pet project discussed in the video?
Analyzing fanfiction metadata and tag relationships.
What inspired the pet project?
A discussion about weird fanfiction pairings with a roommate.
What is the size of the metadata dump from Archive of Our Own?
Over 400 megabytes.
What type of data does the metadata dump contain?
Creation date, language, completion status, word count, and tags.
How many fanfictions are in the metadata dump?
More than 7 million.
What is the purpose of the tags in the metadata?
To categorize and identify fanfictions.
What can be merged in the tag data?
Identical tags representing the same concept, like character names.
What tool can be used for analyzing the CSV files?
A script to combine information and create character pairings.
What does the script do with the tag IDs?
It relates tag IDs to fanfictions and creates pairings.
What program is mentioned for creating visualizations?
Gey.
What type of graph is implemented by GY?
An undirected graph
What is a limitation of GY when handling fanfiction data?
It struggles with too many fanfiction entries (40.5 million)
How many individual fanfictions did the author limit their analysis to?
250,000 individual fanfictions
What are some popular themes in the fanfiction data?
What major shift is observed in fanfiction themes over time?
Growth in Marvel, Supernatural, and Game of Thrones
What criteria were used for character inclusion in the analysis?
Characters appeared at least 10 times
What was eliminated from the character graph to manage complexity?
Characters that appeared less frequently
What does a direct edge between two characters indicate?
They appear in the same fanfiction
What concept is similar to finding the distance between characters in fanfiction?
The Wikipedia game
What happens the further characters are from each other in fanfiction?
The longer it would take to get from one character to another.
What game is mentioned that is similar to the concept described?
The Wikipedia game.
What is the analogy used to describe the distance between characters?
A little bit like the Wikipedia game.
What was the pet project discussed in the video?
Analyzing fanfiction metadata and tag relationships.
What type of data does the metadata dump contain?
Creation date, language, completion status, word count, and tags.
What can be merged in the tag data?
Identical tags representing the same concept, like character names.
What tool can be used for analyzing the CSV files?
A script to combine information and create character pairings.
What is a limitation of GY when handling fanfiction data?
It struggles with too many fanfiction entries (40.5 million)
How many individual fanfictions did the author limit their analysis to?
250,000 individual fanfictions
What are some popular themes in the fanfiction data?
What major shift is observed in fanfiction themes over time?
Growth in Marvel, Supernatural, and Game of Thrones
What criteria were used for character inclusion in the analysis?
Characters appeared at least 10 times
What was eliminated from the character graph to manage complexity?
Characters that appeared less frequently
What concept is similar to finding the distance between characters in fanfiction?
The Wikipedia game
What happens the further characters are from each other in fanfiction?
The longer it would take to get from one character to another.
What is the analogy used to describe the distance between characters?
A little bit like the Wikipedia game.
Are you sure you want to delete 0 flashcard(s)? This cannot be undone.
Select tags to remove from 0 selected flashcard(s):
Loading tags...