| Описание датасета
|
1000 самых популярных книг проекта Гутенберг - имя автора, дата публикации, дата рождения, смерти средняя длина предложений, сложность чтения и т.д.
|
| Описание полей
|
- "bibliography.congress classifications",
- "bibliography.languages",
- "bibliography.subjects",
- "bibliography.title",
- "bibliography.type",
- "metadata.downloads",
- "metadata.id",
- "metadata.rank",
- "metadata.url",
- "bibliography.author.birth",
- "bibliography.author.death",
- "bibliography.author.name",
- "bibliography.publication.day",
- "bibliography.publication.full",
- "bibliography.publication.month",
- "bibliography.publication.month name",
- "bibliography.publication.year",
- "metadata.formats.total",
- "metadata.formats.types",
- "metrics.difficulty.automated readability index",
- "metrics.difficulty.coleman liau index",
- "metrics.difficulty.dale chall readability score",
- "metrics.difficulty.difficult words",
- "metrics.difficulty.flesch kincaid grade",
- "metrics.difficulty.flesch reading ease",
- "metrics.difficulty.gunning fog",
- "metrics.difficulty.linsear write formula",
- "metrics.difficulty.smog index",
- "metrics.sentiments.polarity",
- "metrics.sentiments.subjectivity",
- "metrics.statistics.average letter per word",
- "metrics.statistics.average sentence length",
- "metrics.statistics.average sentence per word",
- "metrics.statistics.characters",
- "metrics.statistics.polysyllables",
- "metrics.statistics.sentences",
- "metrics.statistics.syllables",
- "metrics.statistics.words"
|
| Форматы данных
|
CSV, JSON
|
| Область знаний
|
Психология, Социология
|
| Веб-сайт - ссылка на датасет
|
https://corgis-edu.github.io/corgis/datasets/csv/classics/classics.csv
|
| Примеры использования датасета
|
|
| Год создания датасета
|
|
This dataset is a collection of the top 1000 most popular books on Project Gutenberg, as determined by downloads. Each book has information about its authorship, publication date, congressional classication, and a few other fields. It also has some simple, computed statistics based on common metrics such as sentiment analysis, Flesch Kincaid Reading level, and average sentence length.
Исходный файл
- https://corgis-edu.github.io/corgis
- https://corgis-edu.github.io/corgis/datasets/csv/classics/
Описание столбцов в таблице:
- bibliography.title
- bibliography.author.name
Фильтр в Snap!
Получаем данные
Таблица книг отсортированных по параметрам Rank, Readability, Readability, Comprehension_Difficulty, Polarity, Subjectivity
Таблица