Replies: 2 comments 1 reply
-
|
Update: It seems this only happens in certain parts of the PDF. In other parts of the PDF, the hyphen copies as I was hoping it would above. Now I'm totally confused why this would be. |
Beta Was this translation helpful? Give feedback.
-
|
Text in pdf is "character X at position Y". It's not really a word or a sentence the way you might think of it. Therefore copying & pasting text from PDF into a a sentence is a bunch of heuristics. "Is this letter close to that letter? If yes, they're probably part of the same word" etc. Therefore it all depends on the specific PDF and without access to that specific PDF I can't say anything beyond the generic description above. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
It seems that when I copy from a PDF in Sumatra a paragraph with justified formatting that splits words at the end of the line with a hyphen, the hyphen is not copied. For example, if this is the paragraph in the PDF:
The ablest administrator of his day and a devout
Lutheran, he intervened to support the empire’s Prot-
estants.
When it's copied, only this is sent to the clipboard:
The ablest administrator of his day and a devout
Lutheran, he intervened to support the empire’s Prot
estants.
The hyphen is missing. Is this intentional? I'm wondering if there's a way to have the hyphen copied because when I'm copying from a PDF to a word doc to make notes, I'd rather have the hyphen there so I can run a tool to remove the line break and the hyphen to join the two halves of the word together. In the example above, to join "Prot-" and "estants" into Protestants.
Beta Was this translation helpful? Give feedback.
All reactions