We have over 1,000 pdf documents on our website, and the most important is our Journal, with 238 editions at present. As an example, a couple of issues of our Journal are here, but they are all similar in structure:
You will see that at the bottom of the second page of Journal 100 is a table of contents for the issue and, if the pdf is saved and then opened in Acrobat Reader, there is an equivalent set of bookmarks. It would help if a search term appearing in the article title (as listed in the table of contents) was given a higher weighting than one in the text of the document, but quite often the term will only occur in the text, and not in the title at all.
Ideally, we would like the weighting to be based upon articles, but weightings based upon the the whole Journal is acceptable. This is because the articles tend to be on unrelated topics within a Journal, so there is probably little difference between the number of instances of a specific search term within an article, and within its parent document.
I think it would help me if you could explain, in non-technical terms, how the four weightings operate with WPFTS. I've looked at the TFIDF article on Wikipedia, and understand the basics, but the majority of the article is too technical for me. Perhaps this information could also be added to the WPFTS documentation?
As a related issue, could WPFTS open the document listed in the search results at the first article page where the search term is found? I realise it might go to the article title instead, and if so that wouldn't really help much.
PDFs not appearing in search result
I've been using this extension since the summer, and with great success. However, recently I discovered that PDF files (or documents with PDF attachments) will no longer appear in search results. This happens with "Limit File Types" set to "Allow All", and both with and without "Search in File Contents" enabled. Other files, such as pictures (JPG, JPEG) appear with file contents enabled. The search URL I'm using is
I have tried the same search term in the sandbox area in the admin panel, and here everything works as expected. The document with the PDF attachment appears as the third search result.
I suspect this happened in an update to the plugin, as no configuration change was made as far as I know. The current version installed is v2.46.180
Does anyone have any idea of what's going on?
Do you remember which plugins you've installed recently? Some of them may conflict with WPFTS Pro, especially when making a search at the front end. Please try to disable those plugins temporarily.
If it does not help, let's connect with TeamViewer so I can solve this issue faster (write me a private message or via Live Chat at the main page).