Skip to content

Commit 88225cc

Browse files
modified .travis and README.md
1 parent 8d0c948 commit 88225cc

2 files changed

Lines changed: 10 additions & 9 deletions

File tree

.travis.yml

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,11 @@ language: python
22
python:
33
- '3.5'
44
before_install:
5-
- wget 'https://s3.amazonaws.com/deepcompute-public-data/wordvecspace/small_test_data.tgz'
6-
&& tar xvzf small_test_data.tgz
7-
- export WORDVECSPACE_DATADIR='small_test_data'
5+
- wget 'https://s3.amazonaws.com/deepcompute-public-data/wordvecspace/test_data-0_5_4.tgz'
6+
&& tar xvzf test_data-0_5_4.tgz
7+
- export WORDVECSPACE_DATADIR='test_data-0_5_4.tgz'
88
install:
9+
- sudo apt update
910
- pip install .[service]
1011
- sudo apt install libopenblas-base
1112
script:
@@ -15,8 +16,8 @@ deploy:
1516
skip_cleanup: true
1617
api-key:
1718
secure: LmVvlW+FdYNIDlinjJ4sieONrcx1jaw18J7/mpHBD9ppIWZ+TB6H/iNqkqkh4WvULZttJrTHRYE6rQHXww7KK2UMrjVNE/TVUPaLFDeRRFvLDinAbqJkn+QJia0TuRa/26Bg9cDcvNYTghy7s37xpK2bJTEMF/eCM9b9RHYXilESYy8Z4l8IkFn5vnaDDfT5iV8xjuuOE4lsf4KC3L0xXIkYnKC/LbDVDj3B9h52TpsteL6cZtn/ExAThor5SrVymW7oMR1qrPQv8btNAdxymqJvEbjaP5RUuX7ehihev0Yge47A2X9gvxDRv+a6wM0HOvT4aGsMwCWo++fb0taWH7HUXFxSvkzKhsl74kDMmnE0WarcI/8L/3Q/zRhW1a2vAtj3O0FDHtzS/OK/k3TDk6Fh/LOvk2mTuGD3L34YxJrXxDxnt4tK2ubde8cGeA7pI5jRLNTNQXUip6Dxhr/5ZnMmG2nHI6ujjmDnucE+CHBtUmS1wjBn6ootE4pdoyti0aaA9OrVoGrf39pK7FAG38KJghqn8I3YCLoeapWjI4/DI0WIfq2Vl+v6yQar3Dn9lBLpWFLrjUmZnAx2F1e0P2y0VUg9hl0bINzIIrm2mHw4Zsl2GlMVSR033cwvcbdyeNxKMAfSV3EZBDpNuI6nlkkUZG1O72N/WV+kFRtSdQA=
18-
name: wordvecspace-0.5.3
19-
tag_name: 0.5.3
19+
name: wordvecspace-0.5.4
20+
tag_name: 0.5.4
2021
on:
2122
repo: deep-compute/wordvecspace
2223
# pypitest

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ A high performance pure python module that helps in loading and performing opera
44
This module has ability to the load data into memory using `WordVecSpaceMem` and it can also support performing operations on the data which is on the disk using `WordVecSpaceAnnoy` and `WordVecSpaceDisk`.
55

66
## Installation
7-
> Prerequisites: Python3.5.2
7+
> Prerequisites: >=Python3.5.2
88
99
```bash
1010

@@ -23,7 +23,7 @@ word vector space data. Here are two ways to get that.
2323

2424
```bash
2525
$ wget https://s3.amazonaws.com/deepcompute-public-data/wordvecspace/test_data-0_5_4.tgz
26-
$ tar test_data-0_5_4.tgz
26+
$ tar zxvf test_data-0_5_4.tgz
2727
```
2828

2929
> NOTE: We got this data by downloading the `text8` corpus
@@ -42,7 +42,7 @@ $ git clone https://github.com/tmikolov/word2vec.git
4242

4343
# 1. Navigate to the folder word2vec
4444
# 2. open demo-word.sh for editing
45-
# 3. Edit the command "time ./word2vec -train text8 -output vectors.bin -cbow 1 -size 200 -window 8 -negative 25 -hs 0 -sample 1e-4 -threads 20 -binary 1 -iter 15" ----to----> "time ./word2vec -train text8 -output vectors.bin -cbow 1 -size 5 -window 8 -negative 25 -hs 0 -sample 1e-4 -threads 20 -binary 1 -save-vocab vocab.txt -iter 15" to get vocab.txt file also as output.
45+
# 3. Edit the command "time ./word2vec -train text8 -output vectors.bin -cbow 1 -size 200 -window 8 -negative 25 -hs 0 -sample 1e-4 -threads 20 -binary 1 -iter 15" ----to----> "time ./word2vec -train text8 -output vectors.bin -cbow 1 -size 5 -window 8 -negative 25 -hs 0 -sample 1e-4 -threads 20 -binary 1 -save-vocab vocab.txt -iter 15" to get vocab.txt file also as output.
4646
# 4. Run demo-word.sh
4747

4848
$ chmod +x demo-word.sh
@@ -97,7 +97,7 @@ $ wordvecspace convert /home/user/bindata /home/user/output_dir
9797

9898
`WordVecSpaceMem` and `WordVecSpaceDisk` is a bruteforce algorithm which compares given word with all the words in the vector space
9999

100-
`WordVecSpaceAnnoy` takes wordvecspace output_dir as input and creates annoy indexes in another file (index file). Using this file `annoy` gives approximate results quickly. For better understanding of `Annoy` please go through this [link](https://github.com/spotify/annoy)
100+
`WordVecSpaceAnnoy` takes wordvecspace output_dir as input and creates annoy indexes in another file (index file). Using this file `annoy` gives approximate results quickly. For better understanding of `Annoy` please go through this [link](https://github.com/spotify/annoy)
101101

102102
As we have seen how to import `WordVecSpaceDisk` above, let us look at `WordVecSpaceAnnoy` and `WordVecSpaceMem`
103103

0 commit comments

Comments
 (0)