--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digital-curati...@googlegroups.com.
To post to this group, send email to digital-...@googlegroups.com.
Visit this group at http://groups.google.com/group/digital-curation?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
Perhaps a little old now, but you may find useful stuff on the Open Planets Foundation JP2 Working Group (now disbanded I believe):
http://wiki.opf-labs.org/display/JP2/Home
and also the Wellcome Library's blog:
http://www.jpeg2000wellcomelibrary.blogspot.co.uk/
again, last update 2011 but still has interesting things in it.
At the BL we have migrated TIFFs to JPEG2000s and I understand the reason for doing this is that the file sizes are significantly smaller but image degradation is not so bad.
As with all these things it will probably come down to your own use case - you have a load of things that can render TIFF? Keep the TIFF. You have a load of TIFFs and no significant processing power to convert to JP2 even if you wanted to? Keep the TIFF. You want to save disk space? Use JP2.
My feeling is that neither format is a particular preservation risk at the moment, but as Chris' post points out, currently support for JP2 is patchy at best - so we as a community either need to get involved with the tools, be prepared to pay or use something else!
I'm not aware of any efforts to introduce JP2 support to any of the major browsers.
--
Pete Cliff
Digital Preservation Technical Lead
The British Library
-----Original Message-----
From: digital-...@googlegroups.com on behalf of Creighton Barrett
Sent: Mon 18/03/2013 18:35
To: digital-...@googlegroups.com
Subject: Re: [digital-curation] TIFF vs. JPEG2000
Thanks Christie, much appreciated. I didn't realize use of JPEG2000 was still so low. I would love to hear from people who are using it. Is anyone aware of efforts to introduce browser-support?
On 18 March 2013 14:50, Christie Peterson <christie....@gmail.com> wrote:
A recent blog post on the Library of Congress's Digital Preservation blog did the best job I've seen of laying out issues related to JPEG2000 and preservation (read the discussion in the comments, too): http://blogs.loc.gov/digitalpreservation/2013/01/is-jpeg-2000-a-preservation-risk/
Best,
Christie Peterson
On Mon, Mar 18, 2013 at 1:04 PM, Creighton Barrett <csba...@gmail.com> wrote:
Hi everyone,
I'm curious to hear whether you are using TIFF or JPEG2000 as a preservation scan in your digitization workflows. If you switched to JPEG2000, could you shed some light on the reasoning? What has your experience been? If you considered switching but stuck with TIFF, could you explain why? Did you perform tests on image quality, storage requirements, etc.?
Any info would be much appreciated!
Cheers,
Creighton Barrett
Dalhousie University Archives
--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digital-curati...@googlegroups.com <mailto:digital-curation%2Bunsu...@googlegroups.com> .
To post to this group, send email to digital-...@googlegroups.com.
Visit this group at http://groups.google.com/group/digital-curation?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digital-curati...@googlegroups.com <mailto:digital-curation%2Bunsu...@googlegroups.com> .
2. I disagree with the post that JPEG2000 is not widely used. True, it
is not as used as much as other image formats directly on the Web, but
that is a chicken-and-egg issue.
3. Many, if not most major open source image programs support it, as
do many proprietary ones.
3. There are both Open Source and proprietary libraries readily
available. OpenJPEG, for instance is under active development and
version 2.0 was released just this past November and code was added to
trunk as recently as yesterday.
3. JPEG2000 is supported by a number of web browsers including Konquer
and Safari and by Quiktime within other browsers. See the bugzillia
thread linked to in the LC blog post for more details.
Just wanted to add, Oracle long ago dropped the Java Advanced Imaging API and with it JPEG2000 support in Java. In theory this was adopted as a java.net project, but the encoder/decoder has not been touched for some time.
An alternative, jj2000, is also seemingly without a home:
http://jpeg2000.epfl.ch/ (Doesn’t work)
And the fork at:
http://code.google.com/p/jj2000/
Seems just as defunct (check the issues), albeit accessible.
An active imaging library for Java, Apache Commons Imaging (http://commons.apache.org/proper/commons-imaging/) (formerly Sanselan) does not (currently) support JPEG2000 and I don’t see any feature requests to make do so.
I’m not coming to any conclusions here, but pointed out another example where major vendors are failing to see the need for JPEG2000.
Pete Cliff
Digital Preservation Technical Lead
From: digital-...@googlegroups.com [mailto:digital-...@googlegroups.com] On Behalf Of Chris Adams
Sent: 18 March 2013 23:15
To: digital-...@googlegroups.com
Subject: Re: [digital-curation] TIFF vs. JPEG2000
On Mon, Mar 18, 2013 at 5:52 PM, Edward M. Corrado <ecor...@ecorrado.us> wrote:
--
I'm curious about this: typically preservationists shy away from using compressed TIFFs, so I suspect that's part of the reason for the 1-2 orders of magnitude difference. If that's the case, I think you
would be better off using TIFF with lossless compression than JP2K.
Hello everyone, interesting thread, but no one has mentioned if is possible convert from each other, for example right now I need to convert from TIFF to JPEG2000 in Java, but I haven't been able to find any documentation about that, even in this site.. where you have talked so long about that subject. If someone has done something like that.. I would thank a lot any help..
About convert from TIFF to JPEG2000 in Java
Hello everyone, interesting thread, but no one has mentioned if is possible convert from each other, for example right now I need to convert from TIFF to JPEG2000 in Java, but I haven't been able to find any documentation about that, even in this site.. where you have talked so long about that subject. If someone has done something like that.. I would thank a lot any help..
--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digital-curati...@googlegroups.com.
To post to this group, send email to digital-...@googlegroups.com.
Visit this group at http://groups.google.com/group/digital-curation.
For more options, visit https://groups.google.com/d/optout.
// Create the Image Plan
List<Operation> plan = new ArrayList<Operation>();
Operation op = new Operation(Operation.Type.DECODE);
op.addParameter(“inputimage”, “img0”);
op.addParameter(“outputimage”, “img0”);
plan.add(op);
op = new Operation(Operation.Type.ENCODE);
op.addParameter(“inputimage”, “img0”);
op.addParameter(“outputimage”, “img0”);
op.addParameter(“format”, “jp2”);
op.addParameter(“quality”, 100);
plan.add(op);
// Convert the image
byte[] img = // Read the input image into this byte array
ImageLibrary lib = ImageLibraryFactory.createImageLibrary();
int planId = lib.createOperationPlan(plan);
Map<String, ConvertResult> cnvResult = lib.convert(planId, img);
ConvertResult jp2 = cnvResult.get(“img0”);
// Lossless JP2 is found in byte[] in jp2.img
FileOutputStream fos = new FileOutputStream(xxxx);
fos.write(jp2.img);
fos.close();
// Cleanup
lib.deleteOperationPlan(planId);
Hope this is useful.
tc
Tom Creighton
CTO FamilySearch
You received this message because you are subscribed to a topic in the Google Groups "Digital Curation" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/digital-curation/jX1MELYvpKE/unsubscribe.
To unsubscribe from this group and all its topics, send an email to digital-curati...@googlegroups.com.
Warning: this is a _lengthy_ set of, um, musings, devoid of the helpful and specific technical notes (and even examples of command-line software instructions!) that characterize many of the TIFF-vs.-JPEG2000 comments that preceded this one. Skim-reading is advised.
I send my thanks to the many commentators--including my Library of Congress colleague Chris Adams--for this illuminating thread! I sometimes deliver a stump speech about the lack of consensus about digital preservation practices for video and motion picture film scanning. In the speech, I compare this lack of practice-maturity to the relatively more mature and settled state of digitization that employs still imaging or waveform sound formats.
In the face of this thread, however, my stump speech seems to be a bit of an overstatement. I should know better: I help out with the FADGI Still Image Working Group, and see first hand the still-evolving matter of imaging performance metrics (and related tools) and the slow implementation of fully realized color management in memory institutions. These are two more indicators of the not-yet-settled-ness of still image digitization. (I won't even mention the background topic of embedding metadata and the use of image-level identifiers. Yikes!)
Regarding JPEG 2000 in the still image realm, I echo Chris's frustration over the state of tools: it is maddening that more have not come into play, and Chris is also correct that we ought to collectively put more resources into the game, especially supporting the development of open source tools.
There is a bit of a chicken-and-egg to this, as one commentator noted: with more adoption, more tools would come along; with more tools, we'd have more adoption. Sigh. It is tempting to speculate about the underlying causes, which have nothing to do with image quality: everyone agrees that wavelet transforms (JPEG 2000) provide results that are visually superior to DCT applied to 8-pixel blocks (old JPEG). And I agree with the commentator who wrote that patents are also not a real issue.
Chris mentioned one yardstick for adoption: "Will it play in a browser?" There is a second yardstick that also counts for a lot in our age of ubiquitous digital photography and the social Web: "What formats are natively supported in the camera?" Camera specifications are the outcome of decision-making in Japan, supported by JEITA. The old JPEG format has proven to be good enough for most photographers (in contrast, the pros shoot raw) and JPEG's wide support in many applications leaves camera manufacturers in no mood to change.
A few years ago, I attended a professional photography conference at Microsoft that included strong advocacy for their then-new Windows photo format (subsequently called HD Photo and now standardized as JPEG XR). Like JPEG 2000, this Windows format was clearly technically superior to old JPEG. But the sense of the conference was that if it didn’t get into cameras as a native format, there was little hope of adoption. It is hard not to be reminded of the frustration we hear from our colleagues in professional sound recording (including Neil Young and his push for his Pono format): "How can people stand to listen to those awful MP3 files?" Yet MP3 is everywhere and it passes the "good enough" test for the masses.
It has been interesting to compare the relatively warm embrace that JPEG 2000 has received in moving image quarters with the cold shoulder we see for still images. The Digital Cinema Package (DCP) is _the_ widely adopted format (truly universal these days) for the theatrical distribution of motion pictures and it employs lossy JPEG 2000 compression. To be sure, this is a closed system in which the content creators have dictated the specification to theater operators, and projector manufacturers are required to have their systems tested for conformance. But the very high level of clarity in the lossy JPEG 2000 imagery--thanks again to wavelet transforms--was very important to the motion picture industry. The picture has to look good on the screen; in comparison, Internet-delivered MPEG is still pretty awful. For theaters, JPEG 2000 is unshakably in place.
We also see the adoption and use of JPEG 2000 elsewhere in professional moving image circles, ranging from digital moving image cameras to post-processing systems. Beginning in 2007, the Library of Congress began reformatting its holdings of older video recordings into a format that employs lossless JPEG 2000 compression, and we now have tens of thousands of digital video master recordings in this format, petabytes-worth. My sense is that there is a bit of open source development to support JPEG 200 with moving images but, as noted, the adoption has mostly been among professional users, and these are folks who are very accustomed to purchasing commercial products.
There are a couple of other considerations to throw into this idea marketplace. One came up at our last FADGI still image meeting and was echoed in the flow of comments in this Digital Curation thread. At the FADGI meeting, we heard from a representative from a federal agency that partners with commercial entities. One of their partners scans microfilmed census and other historical records using an all-JPEG-2000 workflow, producing millions of images every year, which are then given to the agency. I don't remember the details, but my impression is that the commercial partner produces JPEG 2000 master images with a modest level of lossy compression. That would be easy to believe: wavelet transforms in modest-lossy compression will retain the legibility even of handwriting. Some folks even argue that a lot of what is discarded in modest-lossy JPEG 2000 compression is noise, thereby (according to this account) improving the image quality. In any case, for these forms of content, neither the originals nor the microfilm masters are being discarded, so the anxieties about authenticity come down a notch.
Meanwhile, I have been told that one or more of the so-called book mass-digitization projects also use lossy-JPEG-2000 workflows. The critiques of those projects tend not to complain about technical image quality but rather about "housekeeping": missing pages, pages out of sequence, thumbs or fingers obscuring text, or blurring caused by a page flipping or moving when being scanned. When the thumbs are out of the shot, the JPEG 2000 imagery delivers a perfectly readable typographic page. And some presentation applications take advantage of JPEG 2000's tiling options to support zooming in the browser, as the Library does for, say, its map collections. (This happens under the covers since, as has been noted, JPEG 2000 imagery cannot be directly displayed in browsers.)
At our FADGI meeting, our colleague's remarks did not concern JPEG 2000 as an output for the agency's own image production; rather, the question had to do with the receipt and preservation of JPEG 2000 images delivered to the agency under the terms of the partner agreement, images that the agency can make publicly available in a few years time. This circumstance--JPEG 2000 images delivered by a partner--also arises in some libraries as a result of their partnering in mass book-digitization partnerships. The point is that some organizations have, ipso facto, large bodies of JPEG 2000 imagery that they need to be able to manage over the medium or long term. This is a just another reminder that many archives must support the long-term management of more than one digital format.
The topic of when and where to use JPEG 2000 at the Library of Congress for still image digitization comes up from time to time in informal conversations. (As noted, this is pretty much a settled matter for moving image materials.) Alas, our informal conversations have thus far produced little more consensus than what is manifest in this thread.
These informal conversations sometimes conflate a pair of topics, the second one being whether, where, and when to accept lossy compression for master files. Some have looked at classes or categories of material, for example (a) catalog cards, (b) widely held twentieth-century printed matter (think "congressional documents"), (c) maps, (d) manuscripts, and (e) photographs (including negatives). Reading from left to right, one can make the argument that the imaging stakes go up in a progressive manner. Might lossy compression be acceptable for the first one or two in the spectrum, and not for the others? The Library carried out a bit of an exploration of the lossy-vs-lossless question in the mid-1990s but the findings did not take root (http://memory.loc.gov/ammem/pictel/). That twenty-five-year-old project compared old JPEG to uncompressed images and, for certain classes of manuscript, lossy seemed acceptable. But the context was different. For example, we were then shy about color--we remembered that monochrome had always been accepted for preservation microfilms and we worried about big files. Today, I think we would likely embrace color for most manuscripts and, if we still accepted lossy, we would be glad to see the added clarity provided by JPEG 2000.
At the risk of forking this thread into a tangent, I would like to ask for experience narratives about the use of PNG, a format that Chris mentioned. For several years, due to inattentiveness on my part, I had relegated PNG to the use case of access-via-browser. This was due to the fact that the format was initially created as a reaction to the threats of licensing fees for GIF (another good-for-browsers format) in the 1990s. Last month, however, I re-read the W3C specification for PNG and found lots of nifty features, on paper at least. For example, there are specs for things that provide helpful support for color management, including a group of tagged color-space metadata elements document things like primary chromaticities and white point, image gamma, and carry an embedded ICC profile. In addition, as has been reported in this thread, PNG offers lossless compression with good results (albeit apparently a bit less efficient than JPEG 2000). Do any libraries or archives that use PNG as a mastering file? Have people found that tools support some of the features that caught my eye, like the ones that support color management?
Whew! Thanks for your patience with this long posting.
Carl Fleischhauer
--
"Anyone got some hi-res scans of newspapers?"