|
1 | 1 | # Meebook to Readwise Converter |
2 | 2 |
|
3 | | -Convert your Meebook ereader highlights and notes to CSV format for seamless import into [Readwise](https://readwise.io). |
| 3 | +Convert your Meebook ereader highlights and notes (from the Haoqing reading app) to CSV format for seamless import into [Readwise](https://readwise.io). |
4 | 4 |
|
5 | 5 | ## 📖 What This Does |
6 | 6 |
|
@@ -97,106 +97,6 @@ The generated CSV files contain the following columns (compatible with Readwise) |
97 | 97 | - **Location**: Sequential number indicating highlight order |
98 | 98 | - **Date**: When the highlight was made (YYYY-MM-DD HH:MM:SS format) |
99 | 99 |
|
100 | | -## 🔧 What Gets Converted |
101 | | - |
102 | | -### From Your Meebook HTML Files: |
103 | | -- **Book Title & Author**: Automatically extracted from the file |
104 | | -- **All Highlights**: Every passage you highlighted while reading |
105 | | -- **Personal Notes**: Any notes you added to highlights (if different from default) |
106 | | -- **Reading Dates**: When each highlight was made |
107 | | -- **Chapter Information**: Organized by book sections |
108 | | -- **Location Data**: Sequential positioning within each book |
109 | | - |
110 | | -### Supported Meebook Export Format: |
111 | | -This tool is specifically designed for Meebook's HTML export format, which includes: |
112 | | -- Book metadata in `<h2>` tags (format: "Title - Author") |
113 | | -- Chapter/section headers with specific styling |
114 | | -- Timestamp information with orange left borders |
115 | | -- Highlight text in structured divs |
116 | | -- Note tables with remark data |
117 | | - |
118 | | -*Tested with: Meebook ereaders using Haoqing Reading app exports* |
119 | | - |
120 | | -## Examples |
121 | | - |
122 | | -### Batch Processing (Recommended) |
123 | | -```bash |
124 | | -python html_to_csv_converter.py --batch |
125 | | -``` |
126 | | - |
127 | | -This processes all HTML files in `html-files/` and creates: |
128 | | -- Individual CSV files for each book in `output/` |
129 | | -- `all_books_combined.csv` with all highlights in `output/` |
130 | | - |
131 | | -### Process specific book |
132 | | -```bash |
133 | | -python html_to_csv_converter.py "html-files/Why We Sleep - Matthew Walker.html" -o "output/sleep_book.csv" |
134 | | -``` |
135 | | - |
136 | | -## File Structure After Processing |
137 | | - |
138 | | -``` |
139 | | -Book-notes-html-to-csv/ |
140 | | -├── html-files/ |
141 | | -│ ├── book1.html |
142 | | -│ ├── book2.html |
143 | | -│ └── book3.html |
144 | | -└── output/ |
145 | | - ├── Book_1_Title.csv # Individual CSV files |
146 | | - ├── Book_2_Title.csv |
147 | | - ├── Book_3_Title.csv |
148 | | - └── all_books_combined.csv # Combined CSV file |
149 | | -``` |
150 | | - |
151 | | -## 🔍 Troubleshooting & FAQ |
152 | | - |
153 | | -### Common Issues |
154 | | - |
155 | | -**"No HTML files found in html-files folder"** |
156 | | -- Make sure you placed your Meebook HTML exports in the `html-files/` folder |
157 | | -- Check that files have `.html` extension |
158 | | - |
159 | | -**"No highlights found"** |
160 | | -- Verify the HTML files are from a Meebook/Haoqing Reading export |
161 | | -- Some books might not have highlights if none were made during reading |
162 | | - |
163 | | -**"Python not found" (Windows)** |
164 | | -- The batch file should handle this automatically |
165 | | -- If issues persist, install Python from [python.org](https://python.org) and try again |
166 | | - |
167 | | -**"Permission denied" or file errors** |
168 | | -- Make sure the `output/` folder isn't open in another program |
169 | | -- Check that you have write permissions in the directory |
170 | | - |
171 | | -### Getting HTML Files from Your Meebook |
172 | | -1. **Connect via USB**: Transfer files directly from your device |
173 | | -2. **Email Method**: Email the HTML files to yourself from the ereader |
174 | | -3. **Cloud Sync**: If your Meebook syncs to cloud storage, download from there |
175 | | -4. **Reading App**: Export from the companion reading app on your phone/tablet |
176 | | - |
177 | | -### Import Tips for Readwise |
178 | | -- Use the **combined CSV file** (`all_books_combined.csv`) for all books at once |
179 | | -- Or import **individual CSV files** if you prefer to process books separately |
180 | | -- Readwise will automatically deduplicate if you accidentally import the same highlights twice |
181 | | - |
182 | | -## 📚 Real-World Example |
183 | | - |
184 | | -Here's what happens when you use this tool: |
185 | | - |
186 | | -**Your Meebook HTML files** (in `html-files/`): |
187 | | -- `Why We Sleep - Matthew Walker_20251026_083943.html` |
188 | | -- `To Kill a Mockingbird - Harper Lee_20251026_083931.html` |
189 | | - |
190 | | -**After running the converter** (in `output/`): |
191 | | -- `Why_We_Sleep_-_Matthew_Walker.csv` (36 highlights) |
192 | | -- `To_Kill_a_Mockingbird_-_Harper_Lee.csv` (7 highlights) |
193 | | -- `all_books_combined.csv` (43 total highlights) |
194 | | - |
195 | | -**Import to Readwise**: |
196 | | -1. Visit [readwise.io/import_bulk](https://readwise.io/import_bulk) |
197 | | -2. Upload `all_books_combined.csv` |
198 | | -3. ✅ All 43 highlights now in Readwise with proper book info, dates, and notes! |
199 | | - |
200 | 100 | ## 🤝 Contributing & Support |
201 | 101 |
|
202 | 102 | ### Found a Bug? |
|
0 commit comments