Skip to content

Text copying produces broken/spaced characters + inaccurate text selection (Turkish PDF) #5627

Description

@96273

I'm experiencing two related issues when working with a Turkish-language PDF in SumatraPDF.

  1. Copied text has excessive spaces between characters
    When I select and copy text, it is pasted with spaces inserted between almost every character (and sometimes between syllables).

Example:
Original text (should be):

HP VE PSÖRİAZİS
Kronik, immün aracılı inflamatuar bir hastalık olan psöriazisin Hp enfeksiyonu ile tetiklenebileceği düşünülmüş...

Copied text (actual):

HP VE PSÖ Rİ A ZİS
Kro nik, im mün ara cı lı inflamatuar bir has ta lık olan
psö ri a zi sin Hp en fek si yo nu ile te tik le ne bi le ce ği dü-
şü nül müş ol mak la be ra ber ça lış ma lar çe liş ki li so-
nuç lar gös ter miş tir.

  1. Text selection is inaccurate / overshoots

When trying to select a specific paragraph or line, SumatraPDF often selects extra text from the line above and/or below. It is very difficult to select only the desired text block (see attached screenshots).
PDF used:
https://www.turkiyeklinikleri.com/pdf/?pdf=829d05d6c3a204658233bc43fd943779


Screenshots:

Image Image

Environment:

SumatraPDF version: [3.6.1]
Windows version: [Windows 10 22H2]

Could you please investigate this? Accurate text selection and copying is very important for academic work, especially when dealing with non-English documents.
Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions