It doesn't like like it is searchable. The collection consists of scanned images of each page. But it's very clear, and every jot and tittle is present. I suppose you could, if you chose, download each page.
But the important thing is that every volume published thus far is available.
Bill
FYI
Return-path: owner-EMF-L@list.tcnj.edu Envelope-to: daniel.odonnell@uleth.ca Delivery-date: Thu, 16 Mar 2006 07:55:39 -0700 Received: from bellatrix.uleth.ca ([142.66.3.43]:52643) by bianca.netsrv.uleth.ca with esmtp (Exim 4.52) id 1FJtt5-0007Ec-NA for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:39 -0700 Received: from exim by bellatrix.uleth.ca with spam-scanned (Exim 4.52) id 1FJtt4-0000rR-AL for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:39 -0700 Received: from cyrus.tcnj.edu ([159.91.15.208]:46735) by bellatrix.uleth.ca with esmtp (Exim 4.52) id 1FJtt4-0000rK-2g for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:38 -0700 Received: from localhost (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id 3D1275A52; Thu, 16 Mar 2006 09:55:37 -0500 (EST) Received: from cyrus.TCNJ.EDU ([127.0.0.1]) by localhost (cyrus.TCNJ.EDU [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 02182-04; Thu, 16 Mar 2006 09:55:36 -0500 (EST) Received: from cyrus.TCNJ.EDU (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with SMTP id B00B55A57; Thu, 16 Mar 2006 09:55:36 -0500 (EST) X-Original-To: EMF-L@list.tcnj.edu Delivered-To: EMF-L@list.tcnj.edu Received: from localhost (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id 309D55957 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:23 -0500 (EST) Received: from cyrus.TCNJ.EDU ([127.0.0.1]) by localhost (cyrus.TCNJ.EDU [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 02050-09 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:22 -0500 (EST) Received: from race1.oit.umass.edu (race1.oit.umass.edu [128.119.101.37]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id E0E5B5878 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:22 -0500 (EST) Received: from [128.119.132.67] (bart-67.dhcp.umass.edu [128.119.132.67]) by race1.oit.umass.edu (8.13.4/8.13.4) with ESMTP id k2GEseXU030186 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:19 -0500 Mime-Version: 1.0 (Apple Message framework v746.3) To: EMF-L@list.tcnj.edu Message-Id: 01A179FE-B107-4690-8F09-ED42F9FD2A71@english.umass.edu Content-Type: multipart/alternative; boundary=Apple-Mail-4-1004961613 From: Stephen Harris sharris@english.umass.edu Subject: MGH Date: Thu, 16 Mar 2006 09:55:18 -0500 X-Mailer: Apple Mail (2.746.3) X-Virus-Scanned: by amavisd-new at TCNJ.EDU Reply-To: EMF-L@list.tcnj.edu Sender: owner-EMF-L@list.tcnj.edu X-Listprocessor-Version: 8.2.09/990901/11:28 -- ListProc(tm) by CREN X-Virus-Scanned: by amavisd-new at TCNJ.EDU X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on bellatrix.uleth.ca X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,HTML_90_100, HTML_MESSAGE autolearn=no version=3.0.4
It appears that the MGH is now on-line for free: http://www.dmgh.de/index.htmlhttp://www.dmgh.de/index.html
Stephen
Stephen J. Harris
Associate Professor of Old English
Department of English
Bartlett Hall
University of Massachusetts
Amherst, MA 01003
Digital Medievalist Project Homepage: http://www.digitalmedievalist.org Journal (Spring 2005-): http://www.digitalmedievalist.org/journal.cfm RSS (announcements) server: http://www.digitalmedievalist.org/rss/rss2.cfm Wiki: http://sql.uleth.ca/dmorgwiki/index.php Change membership options: http://listserv.uleth.ca/mailman/listinfo/dm-l Submit RSS announcement: http://www.digitalmedievalist.org/newitem.cfm Contact editorial Board: digitalmedievalist@uleth.ca dm-l mailing list dm-l@uleth.ca http://listserv.uleth.ca/mailman/listinfo/dm-l
To bad the quality of the scans is a little on the low side. With a tiny bit of training, an OCR could produce JSTOR style and quality PDF over Text scans (i.e. not really that great, but better than nothing). And the scanning is the real cost intensive thing if you don't proof.
-d
On Thu, 2006-16-03 at 16:46 -0330, Bill Schipper wrote:
It doesn't like like it is searchable. The collection consists of scanned images of each page. But it's very clear, and every jot and tittle is present. I suppose you could, if you chose, download each page.
But the important thing is that every volume published thus far is available.
Bill
FYI
Return-path: owner-EMF-L@list.tcnj.edu Envelope-to: daniel.odonnell@uleth.ca Delivery-date: Thu, 16 Mar 2006 07:55:39 -0700 Received: from bellatrix.uleth.ca ([142.66.3.43]:52643) by bianca.netsrv.uleth.ca with esmtp (Exim 4.52) id 1FJtt5-0007Ec-NA for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:39 -0700 Received: from exim by bellatrix.uleth.ca with spam-scanned (Exim 4.52) id 1FJtt4-0000rR-AL for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:39 -0700 Received: from cyrus.tcnj.edu ([159.91.15.208]:46735) by bellatrix.uleth.ca with esmtp (Exim 4.52) id 1FJtt4-0000rK-2g for daniel.odonnell@uleth.ca; Thu, 16 Mar 2006 07:55:38 -0700 Received: from localhost (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id 3D1275A52; Thu, 16 Mar 2006 09:55:37 -0500 (EST) Received: from cyrus.TCNJ.EDU ([127.0.0.1]) by localhost (cyrus.TCNJ.EDU [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 02182-04; Thu, 16 Mar 2006 09:55:36 -0500 (EST) Received: from cyrus.TCNJ.EDU (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with SMTP id B00B55A57; Thu, 16 Mar 2006 09:55:36 -0500 (EST) X-Original-To: EMF-L@list.tcnj.edu Delivered-To: EMF-L@list.tcnj.edu Received: from localhost (localhost [127.0.0.1]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id 309D55957 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:23 -0500 (EST) Received: from cyrus.TCNJ.EDU ([127.0.0.1]) by localhost (cyrus.TCNJ.EDU [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 02050-09 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:22 -0500 (EST) Received: from race1.oit.umass.edu (race1.oit.umass.edu [128.119.101.37]) by cyrus.TCNJ.EDU (Postfix) with ESMTP id E0E5B5878 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:22 -0500 (EST) Received: from [128.119.132.67] (bart-67.dhcp.umass.edu [128.119.132.67]) by race1.oit.umass.edu (8.13.4/8.13.4) with ESMTP id k2GEseXU030186 for EMF-L@list.tcnj.edu; Thu, 16 Mar 2006 09:55:19 -0500 Mime-Version: 1.0 (Apple Message framework v746.3) To: EMF-L@list.tcnj.edu Message-Id: 01A179FE-B107-4690-8F09-ED42F9FD2A71@english.umass.edu Content-Type: multipart/alternative; boundary=Apple-Mail-4-1004961613 From: Stephen Harris sharris@english.umass.edu Subject: MGH Date: Thu, 16 Mar 2006 09:55:18 -0500 X-Mailer: Apple Mail (2.746.3) X-Virus-Scanned: by amavisd-new at TCNJ.EDU Reply-To: EMF-L@list.tcnj.edu Sender: owner-EMF-L@list.tcnj.edu X-Listprocessor-Version: 8.2.09/990901/11:28 -- ListProc(tm) by CREN X-Virus-Scanned: by amavisd-new at TCNJ.EDU X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on bellatrix.uleth.ca X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,HTML_90_100, HTML_MESSAGE autolearn=no version=3.0.4
It appears that the MGH is now on-line for free: http://www.dmgh.de/index.html
Stephen
Stephen J. Harris
Associate Professor of Old English
Department of English
Bartlett Hall
University of Massachusetts
Amherst, MA 01003
Digital Medievalist Project Homepage: http://www.digitalmedievalist.org Journal (Spring 2005-): http://www.digitalmedievalist.org/journal.cfm RSS (announcements) server: http://www.digitalmedievalist.org/rss/rss2.cfm Wiki: http://sql.uleth.ca/dmorgwiki/index.php Change membership options: http://listserv.uleth.ca/mailman/listinfo/dm-l Submit RSS announcement: http://www.digitalmedievalist.org/newitem.cfm Contact editorial Board: digitalmedievalist@uleth.ca dm-l mailing list dm-l@uleth.ca http://listserv.uleth.ca/mailman/listinfo/dm-l
I was just on my way out of office when I received the mails about the MGH online.
As this is most probably my first mail to this list, allow me to introduce myself: My name is Clemens Radl and I am working at the MGH in Munich and am responsible for the digital MGH online (dMGH http://www.dmgh.de).
Right now I don't have the time to go into a lot of details. So this is not a formal announcement of our project. I just wanted to answer some of the points brought up by you.
On Thu, Mar 16, 2006 at 01:27:23PM -0700, Daniel O'Donnell wrote:
To bad the quality of the scans is a little on the low side. With a tiny bit of training, an OCR could produce JSTOR style and quality PDF over Text scans (i.e. not really that great, but better than nothing). And the scanning is the real cost intensive thing if you don't proof.
It's true that the quality of the images presented on the web is not as good as I wish it were. But this is only a preliminary presentation within a framework that was *not* specifically designed for the dMGH. But we wanted to provide access to the scanned images as soon as possible.
The volumes have been scanned with 600 dpi and theses high quality images will be used for OCR. In fact we have already OCR'ed all of the Diplomata volumes and right now we are working at the Epistolae. And we are making good progress.
The software which will allow full text searches is in the final stages of preparation. At least a prototype will be made available soon (I'll refrain from mentioning a specific date, though).
In fact, we are not going to do any proofreading. The images will always be the main source of information. In the background you can do full text searches and the results will be presented in the images with highlighting (roughly comparable to Google Print).
On Thu, 2006-16-03 at 16:46 -0330, Bill Schipper wrote:
It doesn't like like it is searchable. The collection consists of scanned images of each page. But it's very clear, and every jot and tittle is present. I suppose you could, if you chose, download each page.
But the important thing is that every volume published thus far is available.
We have an agreement with our publishers that we will respect a five year boundary, i. e. new volumes will only appear on our web site five years after publication ("moving wall"). But it's true that the overwhelming majority of our printed editions are available online.
Unfortunately, we right now do not have an English translation of our web page, yet. Maybe I'll get to enter some information about the project, our background and our schedule into the dm-l wiki these days ...
But right now I have to go home and get some sleep ;-)
Regards,
Clemens
On Thu, 2006-16-03 at 22:10 +0100, Clemens Radl wrote:
Maybe I'll get to enter some information about the project, our background and our schedule into the dm-l wiki these days
Ahhh an idea after my own heart. I can tell you it is not something you will regret!
-d
...
But right now I have to go home and get some sleep ;-)
Regards,
Clemens