The filter will differ and then copy your files from a folder to the target folder if the file is not existed in your target folder by using hash.
$ python PDF_filter.pyJust for fun.The crawler will only download images from http://www.dbmeinv.com
$ python dbmeinv_crawler.py