Skip to content

Latest commit

 

History

History
62 lines (39 loc) · 1 KB

File metadata and controls

62 lines (39 loc) · 1 KB

pdfReader

How to use

  1. Create a folder named pdf in pdfReader/ and put pdf documents in ./pdf/
  2. Run Python pdfReader.py
  3. pdf documents will be converted into txt files in ./txt/

Dependence: Install pdfminer

How to Install

  • Install Python 2.6 or newer. (For Python 3 support have a look at pdfminer.six).
  • Download the source code.
  • Unpack it.
  • Run setup.py:

$ python setup.py install

  • Do the following test:

$ pdf2txt.py samples/simple1.pdf

parseEEDoc

Parse English and Chinese Papers

How to use

Dependence: Install jieba and wordcloud matplotlib

  • Webpage:

Compatibility

  • Python 2.7.9
  • Python 3.4.3

Dependence:

How to Install

How to Uninstall