View Post [edit]
Poster: | Victor3 | Date: | May 16, 2009 3:51pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
This post was modified by Victor3 on 2009-05-16 22:51:07
Reply [edit]
Poster: | hank_b | Date: | May 27, 2009 4:04pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
User tpb uploaded a host of wonderful texts. Thank you! But for many of them the data are incomplete. Sometimes the title is incomplete, the volume number is missing, or the author name is missing. Example: http://www.archive.org/details/modernephilosop00unkngoog .
========
We are in the midst of a clean-up pass over *all* the public-domain Google books that have been contributed to the Archive, some of which, as you note, arrived with incomplete metadata. Take another look at modernephilosop00unkngoog: it was revisited on May 21, at which point info on author and contributing library was found and added to our copy.
In the past week we've added author info to 25,000 of the Google books for which we had none, and contributor info to 15,000 Google books that lacked it. The number of our Google books with no title given has dropped from 8400 to ~100.
There are no doubt many remaining problems in a collection this size, but bit by bit, we are working to make improvements.
Hank Bromley
software engineer
Internet Archive
Reply [edit]
Poster: | Victor3 | Date: | May 28, 2009 5:57am |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
Desirable would be further:
* That the YEAR appears in an extra field or in the title or the description, and
* That the VOLUME NUMBER is added to the title (For example, search for:
title:(mikrokosmus) AND creator:(lotze)
15 items are found, but one cannot see which is which of three volumes.)
* that the complete title with SUBTITLE appears in the description (or in an extra field for the subtitle).
Reply [edit]
Poster: | hank_b | Date: | May 28, 2009 5:28pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
This post was modified by hank_b on 2009-05-29 00:28:34
Reply [edit]
Poster: | Victor3 | Date: | May 30, 2009 7:22am |
Forum: | texts | Subject: | Missing volume numbers; long subtitles |
This post was modified by Victor3 on 2009-05-30 14:22:30
Reply [edit]
Poster: | hank_b | Date: | May 30, 2009 4:19pm |
Forum: | texts | Subject: | Re: Missing volume numbers; long subtitles |
We're now up to 32,089 Google books with volume info (being added at about 13,000/day).
Reply [edit]
Poster: | bookdev | Date: | Oct 6, 2009 5:17pm |
Forum: | texts | Subject: | Re: How to download all titles |
Reply [edit]
Poster: | Time Traveller | Date: | Oct 6, 2009 7:01pm |
Forum: | texts | Subject: | Re: How to download all titles |
Just why do you want to download all that, seeing that stuff is just as accessible when left on the Archive.
You are talking about having it on your PC, so what will happen when 2 hours later, you database is out of date, seeing uploads are happening every minute 24/7
Reply [edit]
Poster: | stbalbach | Date: | Oct 6, 2009 8:48pm |
Forum: | texts | Subject: | Re: How to download all titles |
Stephen
Reply [edit]
Poster: | bookdev | Date: | Oct 7, 2009 12:32pm |
Forum: | texts | Subject: | Re: How to download all titles |
"Search engine returned invalid information or was unresponsive. We are working to resolve this issue."
Reply [edit]
Poster: | stbalbach | Date: | Oct 7, 2009 2:00pm |
Forum: | texts | Subject: | Re: How to download all titles |
Reply [edit]
Poster: | Victor3 | Date: | May 28, 2009 4:33pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
* But if the year field for these books is complete, why does this information not show up in the normal search
http://www.archive.org/search.php?query=title%3A(mikrokosmus)%20AND%20creator%3A(lotze) ? That would be very useful!
* The presentation of the search results as HTML table does not work in my Firefox 3.0.10 (nor in IE). The data are not shown at all. Can you tell me why? The XML format works.
* The volume information could go into the title or into the volume field, or both. Both would work equally well for me.
Reply [edit]
Poster: | hank_b | Date: | May 28, 2009 5:38pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
The basic search only displays a certain fixed set of fields; no matter how we tweak that set, something that someone wants to see included will be left out. Thus the advanced search: you get to choose which fields you want to see.
I've implemented volume-seeking, and volume info is now being added as we process (or reprocess) Google books. So far we've climbed from 15 Google books with volume info to 30.
Reply [edit]
Poster: | Victor3 | Date: | May 28, 2009 9:10pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
I just saw that now the year always appears in the detail window of every book, in brackets behind the title. If I am not mistaken, that was not so a few weeks ago.
* I just realised how many new texts there are in Google in my field of interest (philosophy in German). It is a lot of work to download and upload them. I hope others or the team is working on this too and have a more efficient way of doing it.
Reply [edit]
Poster: | hank_b | Date: | May 28, 2009 9:43pm |
Forum: | texts | Subject: | Re: How to correct a title already uploaded? |
So for periodicals, "date" is when the series began (or sometimes the range of years the series covered), while "year" will be when a single issue was published. See, for instance, http://www.archive.org/details/ntsiklopediches01andrgoog .
Books usually have a single publication date, used for both "date" and "year", although that, too, can be complicated by multiple editions, translations, etc.