Get WPFTS Pro today with 25% discount!

WPFTS Not Recognising Columns in a PDF Document

  • I noticed a peculiarity in the excerpt that appeared as a result of a search I did yesterday.
    When I searched the pdf concerned using my PDF Editor Program, it found four instances of the search term. Dragging my pointer over the text confirmed that the OCR has correctly picked up the two-column layout of the document.
    The excerpt in the WPFTS search results contained two sentences, from the second and third occurrences of the search, so it hadn't generated an excerpt from, particularly, the first occurrence. Is there a reason for this?
    More importantly, the text in the first sentence of the excerpt didn't make sense until I realised that WPFTS had ignored the two-column layout and was reading straight across the page, picking up text from the left and right columns alternately. I think this is a significant bug, maybe in WPFTS or maybe elsewhere, so I'd appreciate a fix.

  • Hi @Nick

    Could you send me an example of that PDF document, please? I need to test it with extraction software, there should be a bug definitely.

  • @EpsilonAdmin I've sent the requested information by email.

  • This post is deleted!

Suggested Topics

Be the first to read the news!

We are always improving our products, adding new functions and fixes. Subscribe now to be the first to get the updates and stay informed about our sales! We are not spammy. Seriously.

Join Us Now!

We are a professional IT-team. Many of us have been working in a Web IT field for more than 10 years. Our advanced experience of software development has been employed in the creation of the WordPress FullText Search plugin. All solutions implemented into the plugin have been used for 5 or more years in over 60 different web-projects.

We are looking forward to your comments, requests and suggestions in relation to the current plugin and future updates.


The forum powered by NodeBB | Contributors