mark searchable PDF files
mark searchable PDF files
Hi,
Is XYplorer able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)
Explanation:
I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.
The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.
Is XYplorer able to help me?
Thanks,
Mathijs
Is XYplorer able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)
Explanation:
I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.
The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.
Is XYplorer able to help me?
Thanks,
Mathijs
Re: mark searchable PDF files
I thought your question was already answered nicely on the Total Commander forum?
(short answer: Use pdfOCR)
I wrote something for that a while ago. With some minor modifications, that could be adapted to your use case and generate a list of filenames )including path) that need OCR.
(short answer: Use pdfOCR)
I wrote something for that a while ago. With some minor modifications, that could be adapted to your use case and generate a list of filenames )including path) that need OCR.
Re: mark searchable PDF files
Get the Xpdf command line tools and use a custom column like:
Why XY's internal sc
Code: Select all
$tool = "D:\some path\Xpdf tools_x64\bin64\pdftotext.exe";
$output = trim(runret("""$tool"" -simple -nopgbrk ""<cc_item>"" -", %TEMP%, 65001), <crlf>, "R");
if ($output) { return "S"; }
extracttext() is not used? Because it can throw an unavoidable script error on non-searchable pdfs...One of my scripts helped you out? Please donate via Paypal
Re: mark searchable PDF files
I would like to use this script and have defined custom column 16 with it.highend wrote: ↑03 Mar 2021 19:25 Get the Xpdf command line tools and use a custom column like:Why XY's internal scCode: Select all
$tool = "D:\some path\Xpdf tools_x64\bin64\pdftotext.exe"; $output = trim(runret("""$tool"" -simple -nopgbrk ""<cc_item>"" -", %TEMP%, 65001), <crlf>, "R"); if ($output) { return "S"; }extracttext()is not used? Because it can throw an unavoidable script error on non-searchable pdfs...
But I don't understand the help file on the topic "how to find files with custom columns" ?
Where in the find files fields do I have to enter cc16:s which is my understanding of what to search for.
[Edit]
I found that it works in Quick search but still would like to know how it can be done with Find files dialog ?
[Edit]
Found it by further reading the help file.
So I can enter things like !cc16:s in the name field.
I don't find this very intuitive or logical.
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Re: mark searchable PDF files
My question was if XYplorer is able to do that by itself. So no plugins etc.notabot wrote: ↑03 Mar 2021 19:19 I thought your question was already answered nicely on the Total Commander forum?
(short answer: Use pdfOCR)
I wrote something for that a while ago. With some minor modifications, that could be adapted to your use case and generate a list of filenames )including path) that need OCR.
Re: mark searchable PDF files
The same answer as you got in Total Commander forum.mgroen wrote: ↑04 Mar 2021 12:13My question was if XYplorer is able to do that by itself. So no plugins etc.notabot wrote: ↑03 Mar 2021 19:19 I thought your question was already answered nicely on the Total Commander forum?
(short answer: Use pdfOCR)
I wrote something for that a while ago. With some minor modifications, that could be adapted to your use case and generate a list of filenames )including path) that need OCR.
Why should a file manager have such a special function in native code ?
And whats the problem using highends script in XYplorer, it works perfectly for me.
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Re: mark searchable PDF files
I can't see any specific "XY should handle this without external tools" part in the initial question...
Apart from that, if Don implements: viewtopic.php?f=5&t=22805
you can use
Apart from that, if Don implements: viewtopic.php?f=5&t=22805
you can use
extracttext() instead of the external xpdf tool...One of my scripts helped you out? Please donate via Paypal
Re: mark searchable PDF files
Because its about FILES and we are talking about a FILE manager.The same answer as you got in Total Commander forum.
Why should a file manager have such a special function in native code ?
And whats the problem using highends script in XYplorer, it works perfectly for me.
Scripting is very cumbersome if application has the functionality built in. Also very error prown, thats why I am seeking for a good file manager which this functionality built in. Also, TotalCommander script is not usable for XYplorer? (but this is a question from my side)
Re: mark searchable PDF files
There is currently no script for this purpose in Total Commander.mgroen wrote: ↑04 Mar 2021 14:14Because its about FILES and we are talking about a FILE manager.The same answer as you got in Total Commander forum.
Why should a file manager have such a special function in native code ?
And whats the problem using highends script in XYplorer, it works perfectly for me.
Scripting is very cumbersome if application has the functionality built in. Also very error prown, thats why I am seeking for a good file manager which this functionality built in. Also, TotalCommander script is not usable for XYplorer? (but this is a question from my side)
But the script above from highend shows that can do such scripting easier in XYplorer.
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Re: mark searchable PDF files
And now with v21.50.0130
you can just use:
you can just use:
$output = trim(extracttext(<cc_item>, , 1), <crlf>, "R");
return $output ? "S" : "";
One of my scripts helped you out? Please donate via Paypal
Re: mark searchable PDF files
This doesn't work at all for me.
Its extremly slow with SumatraPDF iFilter
and Quick find returns all my 220 tested PDF files as not searchable.
But only 26 of them are not searchable.
This is independand of the used iFilter software, tested with SumatraPDF and TETPDFiFilter.
The pdftotext solution is fast and delivers correct results independand of the used iFilter software.
Last edited by Horst on 04 Mar 2021 18:06, edited 2 times in total.
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Re: mark searchable PDF files
Works fine here and all pdfs are classified correctly.
What do these commands yield on one of those pdfs?
What do these commands yield on one of those pdfs?
Code: Select all
text extracttext(, 32);
text extracttext(, 64);
One of my scripts helped you out? Please donate via Paypal
Re: mark searchable PDF files
text extracttext(, 32); always gives an error.highend wrote: ↑04 Mar 2021 17:39 Works fine here and all pdfs are classified correctly.
What do these commands yield on one of those pdfs?Code: Select all
text extracttext(, 32); text extracttext(, 64);
text extracttext(, 64); always delivers an empty output regardless if the pdf is searchable or not.
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Re: mark searchable PDF files
Attach that pdf here (zipped)...
One of my scripts helped you out? Please donate via Paypal
Re: mark searchable PDF files
Attached 2 examples.
The file "Drive Snapshot - Tips und Tricks.pdf" is searchable
The file "Drive Snapshot - Kommandozeile.pdf" is not searchable
Tested by trying to select text with SumatraPDF.
- Attachments
-
- files.zip
- (968.38 KiB) Downloaded 131 times
Windows 11 Home, Version 25H2 (OS Build 26200.7171)
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
Portable x64 XYplorer (Actual version, including betas)
Display settings 1920 x 1080 Scale 100%
Everything 1.5.0.1400a (x64), Everything Toolbar 2.1.0, Listary Pro 6.3.6.99
XYplorer Beta Club