ePub3 Made More Meaningless

23 March 2014

ePub3, IDPF, Specification, Test Books, Reading Systems

epubtest.org is up and working. The ePubTest site looks quite nice and I do like the way it has addressed the complexity of giving an analysis for a lot of reading systems and properties. Regretfully there is no business or market prioritization or weightage of the various features...

epubtest.org is up and working. We covered this in September last year here. and checked out AZARDI here. The ePubTest site looks quite nice and I do like the way it has addressed the complexity of giving an analysis for a lot of reading systems and properties. Regretfully there is no business or market prioritization or weightage of the various features.

Mandatory and optional features are listed separately, but in the final scoring everything is bundled together. That means dinky optional features get the same weightage as essential (for example) navigation features. However it also means meaningless mandatory features such as CFI get the same weightage as essential content features.

The outcome is that the "scoring system" does not represent anything even slightly useful for users, publishers... or anyone... in terms of understanding the value and usability of any particular reading system.

...the "scoring system" does not represent anything even slightly useful for users, publishers or anyone...

Not all IDPF specification features are equal in importance or value. That is an industry stated fact. This was made very clear with the AAP ePub3 Implementation Project which saw publishing industry inputs on prioritized feature importance. The AAP go it right, but regretfully none of this industry feedback or experience has any effect on the IDPF.

This was an opportunity for the IDPF to syncronize with the publishing industry stated priorities and requirements rather than giving everything the same mark weightage in nine relatively arbitrary feature sections. The evaluation sections were based on the extremely dated 2011 specification written structure rather than 2013 stated publisher needs.

Arbitrary Graphic Comparing Various Book Packages with HTML5

Here is an illustrative graph of various e-reader technologies and how they compare with (X)HTML(5).

Statements have been made that ePub3 is HTML5. That is a deliberate obfuscation of the facts. The core packaging and add-on technologies have nothing to do with HTML5.

Of course ePub3 makes a passing nod to XHTML5 (the content pages are a custom set of XHTML); but ePub3 has so many irrelevant XML, XLINK and arbitrary property add-ons any direct comparison between HTML5 and ePub3 is specious.

Meanwhile E-Book Zero builds exactly on HTML5 with controlled structures and vocabulary and achieves so much more.

For example giving the same mark weightage to vertical Japanese and other yesterday's reading system backward support features gives ranking up even if a reading system doesn't have any significant ePub3 navigation support (the most important AAP feature requirement).

The "score" is an incomprehensible IDPF tab against the pointless features of ePub3 that has no particular meaning for publishers trying to implement real digital content strategies.

The AAP ePub3 Priority Features

Navigation, both via thorough TOCs and proper document structure
Audio, needed for media overlays, aligning with audiobooks, and accessibility
SVG, an XML-based for scalable images with searchable text
Fonts, including embedded fonts for design and special characters
Media Overlays, which enable synchronization of text with recorded audio
Semantic Inflection, for providing important information about an element’s purpose and aids with contextual search and assistive technologies
MathML, the XML format for math, providing resizable rendering and accessibility
Video, including proper use and handling of fallbacks in systems without video
Floats, the CSS function for handling elements outside the narrative text flow
Fixed Layout, which enables control and display of fixed pagination in EPUB

The IDPF assessment has nine feature "categories" and all categories are all given an equal weighing. That simply doesn't make sense. But even worse, within the categories the choice of important items is technical rather than publishing need driven.

Navigation. This is of course the most important feature of all and along with the core packaging defines the ePub3 as something separate from a collection of XHTML5 pages. It covers TOC, Landmarks, custom navigation and spine rendering. We certainly put the maximum effort into navigation when developing AZARDI with matching navigation features in IGP:Digital Publisher as can be seen in the Document Processing Instructions interface. That is because our target was education content. How you move through education content is a very important engagement exercise.
Content Documents. This collection of test cases contains a lot of stuff dumped in together that seriously needs to have a little discernment applied. This could have been richly useful if it had been divided into publisher content concerns as at least four separate books: Text presentation, Images & Media, SVG and MathML. That would have given very strong presentation benefits. It also includes the weirdo epub:triggers, even weirder SVG/Images in spine and that 2005 epub:switch thing.
Styling. This is relatively arbirary list of styles with a nod towards list numbering in various languages as an interesting testcase quirk.
Scripting. The specification is of course hoplessly out of alignment with market requirements in this area
Media Overlays. This is read aloud with text highlighting stuff. It is useful for a lot of types of content.
Font Embedding and Obfuscation. This will be important especially for trade books with typesetting type approaches, multi-language content and specialist education. It was marked by the AAP as important. We published an interesting set of articles of fonts in ePub3 some time ago.
Global Language Support. If only the test cases were global! This should be called Japanese vertical writing and rubytext support and NOT be lumped under this misguided heading. Check out Around the World in 28 Languages for a more comprehensive test case of international languages. This was published in 2011. The ePub3 set of test cases affects only Japanese, Arabic and Hebrew books but it gets 11% of the mark weightage. Weird stuff.
Content Fragment Identifiers. This is a pointless required feature built on failed X-Links. It should be optional. It should be a separate category but because it is never going to be used should have a mark weightage of 2% maximum.
Fixed Layout. The fixed layout test cases are a painful mess. Yes they give an evaluation with the Javascript, but that is not matched with the quality of the presentation. It is difficult to see how these can reasonably be used at present.

The most amusing of these is "international language support". There are in fact just two issues and three languages; vertical Japanese writing, and right to left presentation for Arabic and Hebrew. The IDPF redefines International!

In the scoring system vertical Japanese is given 80% of Global Language Support marks. We understand this is pushed by the "Manga" brigade and is important in Japan and at the time of authoring Sony was strongly supporting ePub3. However this should NOT have the same marks weightage as Navigation. Looking at languages that are actually spoken by reasonable numbers of people this should be something like 1%. This type of feature mis-weighting just lowers the value of everything. There is absolutely no reason vertical Japanese should be a required feature if a reading system is not being promoted for use in Japan.

There is a pointlessness commenting on the IDPF aePub3 specification and "initiatives". But if you care about delivering content, especially education content it is important to stay engaged.

With ePub3 the IDPF pushed more wrong buttons than right buttons. They were of course victims of radical market changes... the death of the autonomous reading device and emergence of tablets and large smart phones. With AZARDI we have been highly selective and rationalized the features supported. The focus, like the AAP, is on the sensible parts of the specification that are HTML5 and forward looking. Anyone who says they are supporting the full ePub3 specification is not serious about digital publishing content. They are distraught technophobes

The ePub3 spec pretty much encapsulates the state of e-reading systems before the advent of tablets and smartphones. The timing for irrelevance was perfect.

Summary

The value of ePub3 for publishers is a reasonably competent and consistently defined packaging and navigation structure. The rest of the specification is lost in a pointless 2007 reading device hole.

It would have been good if more effort had gone into testing the test cases to ensure they were all correct. However the overall quality is higher and more relevant than the notional ePub3 test books on the IDPF site. So that is a step forward.
It would have been good to see alignment between the publishing industry stated priority features and the point system. This would have been a significant industry building step. For example the AAP list of 10 could have been used as the primary features, and the IDPF members' idiomatic features tagged on the bottom.
The marking system is opaque (to be kind) to the point of uselessness. It provides no tools or intelligence on reading system usefulness or actual ePub3 specification usefulness.
The fact that the IDPF did make an effort on these test cases after only nearly three years probably reflects the importance of this to the publishing industry and of course the various IDPF members. EPub3 is such an also-ran in the digital content world of 2014. Where is the credulity? Certainly not in the actions of the IDPF.

I see articles about Apps vs. e-books from time to time such as this article "Five Myths About Book Apps" by Karen Robertson. Meanwhile AZARDI in ePub3 and E0 modes can show any content in an App-like manner as it has scripting, SVG, MathML, fixed, scrolling and reflowing layout. That is because we decided to select "the" useful and sensible features from the arcane ePub3 specification, ignore the un-needed features, step around the un-wanted features and get down to business way back in 2011.

You can download the free Desktop version of AZARDI here. Take a look at some of our sample ePub3 books and E-Book Zero books to see just what can be done when you eliminate annoying print-book trim.

AZARDI ticks off the AAP priority list and of course adds real Internationalization support. We are now working on 206 languages as mentioned in the previous post. This is an exciting new dimension to accessibility that digital content delivers. Of course that means fonts are very important.

So I guess now we just have to wait for Reading systems based on the IDPF Readium API project underway and see if that makes any difference for anyone.

Posted by Richard Pipe