mynameisfiber/nanogenmo2015

语言: Python

git: https://github.com/mynameisfiber/nanogenmo2015

README.md (中文)

什么东西nanogemno

首先,我们必须解析一些古腾堡数据!因为我找不到 任何符合条件的东西,我都要解析HTM文件(因为html 必须比原始文本更容易使用)。

下一个?世界!

数据

下载地址:

  • HTTP://mirrors.抛光蜡封.org/Gutenberg-ISO/苹果是非常大-032007.zip
  • HTTP://mirrors.抛光蜡封.org/Gutenberg-ISO/PG DVD072006.ISO

并从他们孤独的ISO容器中提取出来。

本文使用googletrans自动翻译,仅供参考, 原文来自github.com

en_README.md

something something nanogemno

To start off, first we must parse some Gutenberg data! Since I couldn't find
anything that fit the bill, I'm going to parse through the HTM files (since html
is must easier to work with than raw text).

Next? THE WORLD!

data

Downloaded from:

and extracted out of their lonely ISO containers.