Skip to content

Title: Case-insensitive search issue with Turkish characters (İ / i) #5597

@96273

Description

@96273

I would like to report a possible issue with the search functionality in SumatraPDF.

While searching within a PDF document, I noticed that case-insensitive search does not seem to work correctly for certain non-ASCII characters, specifically Turkish letters.

Description of the issue

In the document below, there is a word: "İbradı" (capital dotted "İ").

When I search for "ibradı" (all lowercase), the word is not found.
When I search for "İbradı" (with uppercase "İ"), it is found immediately.

This suggests that the search may not be properly handling Unicode case folding for Turkish characters.

Steps to reproduce

Open the following PDF:
https://cdniys.tarimorman.gov.tr/api/File/GetFile/330/Sayfa/1416/1778/DosyaGaleri/21._orman_kaynaklarindan_faydalanma.pdf

Search for: ibradı — No results

Search for: İbradı — Result found

Expected behavior

The search should ideally be case-insensitive (unless explicitly set otherwise) and correctly handle Unicode characters such as Turkish dotted/dotless "i".

Actual behavior

Search appears to be case-sensitive or not fully Unicode-aware for this character.

Additional materials

Screenshot:

Image Image

I am not sure whether this is an intended limitation or a bug, but I wanted to report it in case it can be improved.

Thank you for your work on SumatraPDF!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions