What is your e-mail address?

My e-mail address is:

Do you have a password?

Forgot your password? Click here
close

Library develops tools to bag and tag data for large file transfers

GCN Awards One of the tools developed to assist the Library of Congress in its National Digital Information Infrastructure and Preservation Program is the BagIt specification, a file package format that allows organizations to bag and tag data for transferring large files.

The goal was simplicity and ease of use, said Martha Anderson, NDIIP program director.

“A ‘bag’ has just enough structure to safely enclose a brief descriptive ‘tag’ and a payload but does not require any knowledge of the payload's internal semantics,” according to the specification.

It was developed out of an archive ingest and handling project with NDIIP and a number of universities that simulated the transfer of 50G files across a network.

“What we learned is that we needed a kind of common container to move things around,” Anderson said. They experimented with a number of existing formats and tools, such as Zip files, but they lacked a manifest element that describes the content of the envelope, were too complicated and could not handle large enough files.

“This specification can handle arbitrary sizes of data,” she said.

BagIt is not an official standard, but “we make it the standard for any content sent to the library,” she said. “It’s proving itself to be quite useful. We are promoting it widely.”

About the Author

William Jackson is a senior writer of GCN and the author of the CyberEye column.

Reader Comments

Please post your comments here. Comments are moderated, so they may not appear immediately after submitting. We will not post comments that we consider abusive or off-topic.

Your Name:(optional)
Your Email:(optional)
Your Location:(optional)
Comment:
Please type the letters/numbers you see above
GCN Awards 2012

GCN eNewsletters

Editorial Webcasts

  • Cloud Computing: Ushering in the Next Wave of Data Center Consolidation Register Now

    In this webcast, a government IT expert will explore the top considerations, operational requirements and policy challenges inherent to integrating new and legacy applications in the cloud. You will explore the pros and cons of adopting a public vs. private cloud model based on your specific security and operational requirements, as well as how you can fully leverage your cloud investment to achieve efficiency, collaboration and transparency needs. Read more