PIH
PIH
Do you have any other algorithms to try?
- Attachments
-
- Appunti-20200130.png (335.58 KiB) Viewed 5054 times
Tag Backup - SimpleUpdater - XYplorer Messenger - The Unofficial XYplorer Archive - Everything in XYplorer
Don sees all [cit. from viewtopic.php?p=124094#p124094]
Don sees all [cit. from viewtopic.php?p=124094#p124094]
Re: PIH
Code: Select all
Set it to a value larger than 0 to define the tolerated distance for two
images to be counted as dupes.
Already tolerance 1 will ignore scale, brightness, contrast, saturation,
and hue. Higher tolerance will ignore JPEG artifacts and minor differences
in detail.
smt like:
1 = ignore scale
2 = ignore brightness
4 = ignore contrast
8 = ignore saturation
or at least 1,2,3,4, and only on 5 "Higher tolerance will ignore JPEG artifacts"
Win 7 SP1 x64 100% 1366x768|1900x1080
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
I see what you mean, but this (partly) sounds more easy than it's done. At the moment I'm quite satisfied with the current state as "phase one" and would like to publish it to the masses and see the kind of feedback I'll get.
But what about the feedback here in the forum? Does it work for you as good as for me? (in my personal photo collection it does a perfect job at the moment)
But what about the feedback here in the forum? Does it work for you as good as for me? (in my personal photo collection it does a perfect job at the moment)
FAQ | XY News RSS | XY Bluesky
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
Thanks! And did the results look good? Not too many false positives are false negatives?
Now the interesting part: You cannot set the hash method here, this dropdown belongs to the "Content" dupe search above. So you have run the same search two times and got different results! Hmm, there is no learning built in...
But I once saw the same here during my tests. I'll watch it...
PS: Your second search was much slower??? That cannot be. Or was it actually the first search? (There is heavy caching here, any non-first search will be MUCH faster).
Now the interesting part: You cannot set the hash method here, this dropdown belongs to the "Content" dupe search above. So you have run the same search two times and got different results! Hmm, there is no learning built in...

PS: Your second search was much slower??? That cannot be. Or was it actually the first search? (There is heavy caching here, any non-first search will be MUCH faster).
FAQ | XY News RSS | XY Bluesky
Re: PIH
visual presentation is good, groups are easily determined, text info is well informative. didn't check all dupes, don't really want to sort them right now, but checked about 10-15 groups = no false at all.Thanks! And did the results look good? Not too many false positives are false negatives?
though you say that makes no difference, still probably speed really depends on CPU business, 'cause here i made 9 searches with so to say "different methods" and there are differences in time. though i must mention that 1-4 was while doing some other stuff, and 5-9 was while doing nothing waiting for results. plus can't say that "next runs" were actually "much faster", though again maybe it's due to old laptop.You cannot set the hash method here, this dropdown belongs to the "Content" dupe search above. So you have run the same search two times and got different results!
Win 7 SP1 x64 100% 1366x768|1900x1080
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
I see you have Content and Image ticked. That does not make much sense since Content dupes are logically always also Image dupes. 
Thanks for that insight which it allows me to make my code faster: if Content is ticked then any check for Image dupes can be skipped for logical reasons.
a) if content differs we don't have to check for image since all Dupes filters are AND-ed
b) if content is same then image is same as well

Thanks for that insight which it allows me to make my code faster: if Content is ticked then any check for Image dupes can be skipped for logical reasons.
a) if content differs we don't have to check for image since all Dupes filters are AND-ed
b) if content is same then image is same as well
FAQ | XY News RSS | XY Bluesky
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
FYI, you can now search images by similarity to a given image. ATM it's a two-step process:
1. Retrieve image hash of the given image:
2. Go to your search location and use this search term (1b116dc5c3c5311b is an example for the hash returned above):
Future enhancements could be
- a way to pass a tolerance value to the search (at the moment the default here is tolerance = 1)
- a fully wrapped comfort feature where you point to an image and order "find similar images!"
1. Retrieve image hash of the given image:
Code: Select all
text "<prop #image.hash>";
Code: Select all
prop:#image.hash:1b116dc5c3c5311b
- a way to pass a tolerance value to the search (at the moment the default here is tolerance = 1)
- a fully wrapped comfort feature where you point to an image and order "find similar images!"
FAQ | XY News RSS | XY Bluesky
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
PS: Of course, the same syntax also lets you define color filters for similar images, how cool is that?
Code: Select all
prop:#image.hash:1b116dc5c3c5311b
FAQ | XY News RSS | XY Bluesky
Re: PIH
Well, not very cool. I'm supposed to be managing files and not playing with neat new features. What are you doing to me?
(And yes, wow, this really is a neat feature! I can finally get rid of this other ancient tool I have been using for the longest time.)
(And yes, wow, this really is a neat feature! I can finally get rid of this other ancient tool I have been using for the longest time.)
-
- Site Admin
- Posts: 63404
- Joined: 22 May 2004 16:48
- Location: Win8.1, Win10, Win11, all @100%
- Contact:
Re: PIH
It might even get uncooler. So many things are imaginable now. What about sorting photos by their dominant color hue (like some folks do with their books on the shelf)?
FAQ | XY News RSS | XY Bluesky
Re: PIH
sounds as neat feature for huge photo archives.

i wonder if it may allow in future to hash an image with lots of green(photo in forest), and with some # tolerance find not similar, but same "lots-of-green"-images?. it's just a question of curiosity.

Win 7 SP1 x64 100% 1366x768|1900x1080