GPO/GODORT Conf Call Minutes Posted

Bill Sleeman, chair of the ALA Government Documents Roundtable (GODORT), recently posted the minutes to the GPO / GODORT Steering Conference Call of March 12, 2008. These conference calls take place from time to time and often have news of value. The minutes can be found (may have to scroll) at http://wikis.ala.org/godort/index.php/GODORT_Chair and covered the following topics, among others:

  • Request for Information for Mass Digitization Opportunities
  • Status of EPA Web Harvesting
  • Status of the Federal Digital System (FDSys)
  • Addition of pre-1976 cataloging to the Catalog of Government Publications - in progress.
  • Continued distance ed through OPAL
  • Current stats on the newish Government Information Online reference service.

If I were you, I'd look over the entire set of minutes as it was all interesting. I'd like to highlight two issues, both of which cry out for the documents community to do more to support GPO in some of its efforts:

EPA Web Harvest Project

Here are the notes on this subject (full names available from minutes page):

LH & RHM: Status of EPA harvesting project: GPO worked through 300 of the documents to gather information on what it will take for GPO to provide access to harvested materials (process, workflow and staffing implications). So far: the back end automation of meta-data extraction is not ready; parameters for metadata that accompanies the files needs improvement to automate de-duping; and the rules, methods and mechanisms for harvesting need to be refined (approximately 28% of material was not in scope). Basically, it is still taking more staff time to make these available than GPO can afford. BS asked about the FGI taxonomy experiment and if GPO would be investigating the results of that effort. GPO may incorporate that information into the project as the project moves forward.

GPO's results of automated harvesting finding a lot of out of scope material and difficult automated extraction of metadata are about what I expected based on my own experience and from my reading of the literature. Whether or not GPO builds on our modest taxonomy experiment (Thanks Bill!), I think that a GPO - community/citizen collaboration will be needed to begin getting a handle on web-based agency documents. They could start simply by publishing their spidering logs and see what happens. Or perhaps they can obtain some of the $2 Billion/week currently being spent elsewhere. If GPO choose to take the mass collaboration route, I hope the documents community is in the forefront of helping them.

If you're interested in taking part in our tagging experiment, please see http://freegovinfo.info/epatagging. We will be running the project through April 18, 2008. To see what has been tagged so far, please visit http://del.icio.us/tag/epapilotproject.

OPAL Training

Here are the notes on this subject:

LC: OPAL, GPO continues to use OPAL for online training and demos. At present, technical capabilities limit presentations to slide shows, such as PowerPoint presentations. Interactive web functions will be added in the future. January call for participation in creation of tutorials netted one submission; hoping to generate interest at DLC.

The FDLP has over 1200 libraries and GPO got ONE SUBMISSION? A majority of FDLP libraries are teaching oriented academic libraries and GPO got ONE SUBMISSION?

Hello! I know I'm not the only one who has insisted that GPO provide training between conferences for those of us who don't get out much. The documents community has a great reservoir of government information expertise. We should be actively aiding GPO in their efforts to spread that expertise.

I admit that GPO's one submission wasn't from my library. I have a pretty new docs staff that's still getting up to speed. But that can't be the case everywhere. If only 10% of FDLP libraries could step up with a program, that would still be 120 programs -- twice a week for a whole year.

Just so I can at least pretend to put my money (or staff time) where my mouth is, I will spend some time next month looking at our library's gov info information strengths, our customer needs and patron interests. And then sometime during the summer I or someone else from our library will submit a program. If you run a depository, will you commit to doing the same? Not only does GPO need our help, so do our colleagues.

FGI thanks the GODORT and GPO personnel who participated, Jill Vassilakos-Long for taking the minutes and Bill for posting them to the ALA GODORT Wiki.

No votes yet

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Question about scope

Is there more information about what is "in scope" and what is "not in scope"? (I assume this is based on USCode, Title 44 defintions.) This concerns me for two reasons:

- Is the application of title 44 scope to web harvesting necessary? (Will anyone ever try to enforce a narrow reading of scope in a web-based environment?)

- Is the application of a title 44 scope to web harvesting appropriate or useful? (Was the law written to deal with non-web-based information and will application of its definitions be counter to the spirit, if not the letter, of the law?)

GPO Scope Criteria

GPO laid out their harvesting criteria in the document:

CRITERIA AND PARAMETERS FOR GPO’S WEB HARVESTING PILOT PROJECT
http://www.gpo.gov/su_docs/fdlp/harvesting/criteria.pdf

Pages 3-4 offer this statement of scope for the EPA web harvest project:

SCOPE OF PUBLICATIONS TO BE HARVESTED
Publications to be harvested are those issued by EPA and within scope of the Federal Depository Library Program and the National Bibliography of U.S. Government Publications.

Scope of the Federal Depository Library Program

The scope of the FDLP includes all published Federal government information products, regardless of format or medium, which are of public interest or educational value, except for those products which are for strictly administrative or operational purposes, classified for reasons of national security, or the use of which is constrained by privacy considerations.

Included in the FDLP are publications created as a result of U.S. Government funded contract or grant. Included in the front matter of these publications is a statement indicating that the publication was funded by a grant or contract or produced under contract or grant. Publications funded through grant or contract may have more than one issuing agency, the Federal agency and another publisher. Publications at the National Sea Grant Library at http://nsgd.gso.uri.edu/ are U.S. Government publications as the funding for the publications is provided by the National Oceanic and Atmospheric Administration.

Scope of National Bibliography of U.S. Government Publications

The National Bibliography includes all publications in the FDLP as well as cooperative publications and other U.S. Government publications that are for strictly administrative and/or operational purposes (e.g. forms).

The National Bibliography is a comprehensive catalog containing descriptions and locations of U.S. Government unclassified publications in all formats. The National Bibliography describes any publication, regardless of form or format that any U.S. Government agency publishes, disseminates, or makes available to the public that is of public interest or educational value, as well as any publication produced for administrative or operational purposes. Publications represented in the National Bibliography are acquired from official sources or sites, and are subject to official use or security classification restrictions.

In short, the National Bibliography is a “comprehensive index of public documents,” including “every document issued or published” not subject to official use restrictions or “not confidential in character”. Source: 44 U.S. Code §1710 http://www.access.gpo.gov/uscode/title44/chapter17_.html

Publications identified for the National Bibliography are cataloged and appear in the GPO’s Catalog of U.S. Government Publications. A new version of the catalog is currently in development at http://franklin.gpo.gov/.

It is presumed that information accessible on an agency’s public Web site is not for strictly administrative or operational purposes, classified for reasons of national security, or constrained by privacy considerations. It is also presumed that some cooperative publications may be publicly accessible online (and the issuing Federal agency recovers costs by selling the tangible format). Therefore, all publicly accessible publications on the Internet that EPA has published, disseminated, or made available to the public should be harvested.

The pages that follow provide additional clarification on what is a government publication.

------------------------------------
"And besides all that, what we need is a decentralized, distributed system of depositing electronic files to local libraries willing to host them." -- Daniel Cornwall, tipping his hat to Cato the Elder for the original quote.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Easily link to terms in various wikis. For help, see <a href="/interwiki/3">interwiki</a>.

More information about formatting options

Syndicate content