View Post [edit]
Poster: | Tom Gally | Date: | Dec 11, 2009 6:20pm |
Forum: | texts | Subject: | Missing images from Google book scans |
http://www.archive.org/search.php?query=%28collection%3Aamericana%20AND%20format%3Apdf%20AND%20mediatype%3Atexts%29%20AND%20-mediatype%3Acollection&sort=-publicdate
http://www.archive.org/search.php?query=-description%3A%28Google%29%20AND%20%28collection%3Aamericana%20AND%20format%3Apdf%20AND%20mediatype%3Atexts%29%20AND%20-mediatype%3Acollection&sort=-publicdate
The first URL lists all of the most recent additions to the Americana collection, while the second URL excludes those taken from Google (nearly all added, it seems, by user tpb, whom I presume is a 'bot). Even though it returns many fewer books, I prefer the latter URL because, too often, the scans from Google are bad in multiple ways--folded-over pages, visible fingers, resolution too coarse for the text to be read, and, worst of all, omitted images.
Examples of missing images, all from books taken from the former URL a few minutes ago, can be seen here:
http://www.archive.org/stream/economicmininga01lockgoog#page/n410/mode/2up
http://www.archive.org/stream/edinburghphilos10edingoog#page/n381/mode/2up
http://www.archive.org/stream/earthadescripti01reclgoog#page/n34/mode/2up
Presumably Google removed these images in order to improve the accuracy of OCR conversion, but it's a shame that these files are being added in such large numbers to the Internet Archive when, one would hope, better scans must be available somewhere.
Does anyone know why these defective versions, rather than versions with the illustrations intact, are being added? Do the libraries at which the books were scanned (Harvard University, University of California, etc.) know that defective versions of their books are being added to the Internet Archive? Can anything be done to replace those scans with better ones?
Reply [edit]
Poster: | stringybark | Date: | Dec 11, 2009 7:15pm |
Forum: | texts | Subject: | Re: Missing images from Google book scans |
Reply [edit]
Poster: | stbalbach | Date: | Dec 11, 2009 10:09pm |
Forum: | texts | Subject: | Re: Missing images from Google book scans |
Stephen
Reply [edit]
Poster: | Time Traveller | Date: | Dec 11, 2009 11:10pm |
Forum: | texts | Subject: | Re: Missing images from Google book scans |
It may be that with it all in the news media, authors are opting out, becuase they did not know before, how Google was using their book (s)
Peter
Reply [edit]
Poster: | Time Traveller | Date: | Dec 11, 2009 10:13pm |
Forum: | texts | Subject: | Re: Missing images from Google book scans |
But they are sort of profit making, while volunteers do the scanning for the IA, a labour of love as opposed to a salary from Google.
Peter