|
Метки: очистка ручная отмена |
| Строка 1: |
Строка 1: |
| {{Dataset
| |
| |Description=1000 самых популярных книг проекта Гутенберг - имя автора, дата публикации, дата рождения, смерти средняя длина предложений, сложность чтения и т.д.
| |
| * http://digida.mgpu.ru/images/thumb/b/b0/Book_RG01.png/120px-Book_RG01.png
| |
| |KeyDescripions=# "bibliography.congress classifications",
| |
| # "bibliography.languages",
| |
| # "bibliography.subjects",
| |
| # "bibliography.title",
| |
| # "bibliography.type",
| |
| # "metadata.downloads",
| |
| # "metadata.id",
| |
| # "metadata.rank",
| |
| # "metadata.url",
| |
| # "bibliography.author.birth",
| |
| # "bibliography.author.death",
| |
| # "bibliography.author.name",
| |
| # "bibliography.publication.day",
| |
| # "bibliography.publication.full",
| |
| # "bibliography.publication.month",
| |
| # "bibliography.publication.month name",
| |
| # "bibliography.publication.year",
| |
| # "metadata.formats.total",
| |
| # "metadata.formats.types",
| |
| # "metrics.difficulty.automated readability index",
| |
| # "metrics.difficulty.coleman liau index",
| |
| # "metrics.difficulty.dale chall readability score",
| |
| # "metrics.difficulty.difficult words",
| |
| # "metrics.difficulty.flesch kincaid grade",
| |
| # "metrics.difficulty.flesch reading ease",
| |
| # "metrics.difficulty.gunning fog",
| |
| # "metrics.difficulty.linsear write formula",
| |
| # "metrics.difficulty.smog index",
| |
| # "metrics.sentiments.polarity",
| |
| # "metrics.sentiments.subjectivity",
| |
| # "metrics.statistics.average letter per word",
| |
| # "metrics.statistics.average sentence length",
| |
| # "metrics.statistics.average sentence per word",
| |
| # "metrics.statistics.characters",
| |
| # "metrics.statistics.polysyllables",
| |
| # "metrics.statistics.sentences",
| |
| # "metrics.statistics.syllables",
| |
| # "metrics.statistics.words"
| |
| |FileFormat=CSV, JSON
| |
| |Field_of_knowledge=Психология, Социология
| |
| |Website=https://corgis-edu.github.io/corgis/datasets/csv/classics/classics.csv
| |
| }}
| |
| This dataset is a collection of the top 1000 most popular books on Project Gutenberg, as determined by downloads. Each book has information about its authorship, publication date, congressional classication, and a few other fields. It also has some simple, computed statistics based on common metrics such as sentiment analysis, Flesch Kincaid Reading level, and average sentence length.
| |
|
| |
|
| [[Файл:Book RG01.png|800px]]
| |
|
| |
| == Исходный файл ==
| |
| ; https://corgis-edu.github.io/corgis
| |
| : https://corgis-edu.github.io/corgis/datasets/csv/classics/
| |
|
| |
| Описание столбцов в таблице:
| |
| * bibliography.title
| |
| * bibliography.author.name
| |
|
| |
| == Фильтр в Snap! ==
| |
|
| |
| [[Файл:Data Exteranal Book.png]]
| |
|
| |
| == Получаем данные ==
| |
| * get_web_data
| |
|
| |
| {{#get_web_data:url=https://corgis-edu.github.io/corgis/datasets/csv/classics/classics.csv
| |
| |format=csv with header
| |
| |data=Title=bibliography.title,Author=bibliography.author.name, Rank=metadata.rank,Readability=metrics.difficulty.automated readability index, Understandability=metrics.difficulty.coleman liau index, Comprehension_Difficulty=metrics.difficulty.dale chall readability score, Polarity=metrics.sentiments.polarity, Subjectivity=metrics.sentiments.subjectivity
| |
| }}
| |
|
| |
| == Таблица книг отсортированных по параметрам Rank, Readability, Readability, Comprehension_Difficulty, Polarity, Subjectivity ==
| |
| ; важные заклинания для работы с данными
| |
| <nowiki>
| |
| {{#for_external_table:
| |
| </nowiki>
| |
|
| |
| ===== Таблица =====
| |
| {| class="wikitable sortable"
| |
| ! Title
| |
| ! Author
| |
| ! Rank
| |
| ! Readability
| |
| ! Comprehension_Difficulty
| |
| ! Polarity
| |
| ! Subjectivity {{#for_external_table:<nowiki/>
| |
| {{!}}-
| |
| {{!}} {{{Title}}}
| |
| {{!}} {{{Author}}}
| |
| {{!}}{{{Rank}}}
| |
| {{!}} {{{Readability}}}
| |
| {{!}} {{{Comprehension_Difficulty}}}
| |
| {{!}} {{{Polarity}}}
| |
| {{!}} {{{Subjectivity}}} }}
| |
| |}
| |
|
| |
|
| |
|
| |
|
| |
| [[Category:Dataset]]
| |