Jeff Duntemann's Contrapositive Diary Rotating Header Image

ebooks

The Pulps Reconsidered, Part 4

BasketballStoriesCover350Wide.jpgThe essential difference between literary (as we define it today) and non-literary fiction didn’t crystallize for me until first-person shooters happened. I’m not one for games in general, but an hour or two playing early shooter games like Doom and Quake back in the 90s was an epiphany: This is a species of fiction. The following years proved me right. Most ambitious action games have at least a backstory of some kind, and some modern MMORPG systems have whole paperback novels distilled from them. (See Tony Gonzales’ EVE: The Empyrean Age, based on EVE Online.)

Of course it’s not literature. Did anybody say it was?

What it is is something else, something important: immersive. You get into a good game, and you’re there. I can do the same thing with a decent SF novel, but the phenomenon is in no way limited to SF. I’m guessing that Farmville or almost any reasonably detailed simulation works the same way.

Immersivity is the continental divide between literary fiction and pulp fiction. Like anything else in the human sphere it’s a spectrum, placing World of Warcraft on one end and Finnegan’s Wake on the other, with everything else falling somewhere in the middle. The term measures the degree to which you can lose yourself in a work, where “lose yourself” means “forget that you’re reading/playing and enter into experiential flow.”

Don’t apply a value scale to immersivity. It’s only one dimension of many to be found in fiction, and my point here isn’t to dump on Finnegan’s Wake. Literature is intended to evoke a response in the reader, but that response is not necessarily immersion. (It can be, particularly with classics like Huckleberry Finn that are new enough to be culturally familiar to us–dare you to read Chaucer without footnotes!–and yet not so new as to be afraid of Virginia Woolf.)

Pulling the reader in and carrying him/her along requires a smooth, linear narrative style, a vivid setting, and enough going on to maintain the reader’s interest after a long day working a crappy job. Pulp characters are often types, but that’s not necessarily due to a lack of skill on the writer’s part. A carefully chosen and well-written type allows room for a reader to imagine being that character, which is important in immersive fiction. As much as I enjoyed Gene Wolfe’s massive Book of the New Sun (and I’ve read it three times since its publication) I had a very hard time imagining myself as Severian. I empathize with him and certainly enjoyed watching him against the dazzling surreality of Urth (though I had to read numerous sections several times to be sure I knew what was going on) but being him? No chance. Keith Laumer’s Retief, on the other hand, no problem. Louis Wu? Same deal.

And for the umptieth time: (I can hear the knives being sharpened) This is not to denigrate literary fiction, of which I’ve read a lot and still do. My point is that immersive fiction is a valid entertainment medium, requiring different mechanisms and different skills than literary fiction. Let’s not dump on things for simply being easy to read. Easy is good if easy is what you want–and (on the author side) if easy is what people are willing to pay for.

Which should not suggest that easy to read is necessarily easy to do. The immersive magic of the pulps is obscured by the fact that a lot of it was just badly done, and could not have been otherwise, given that some pulp titles paid a quarter cent a word and published eighty thousand words twice a month. We can do much better these days, at least on the quality side. A brilliant potboiler is eminently possible–if we as readers give authors some sense that it’s ok to take up the challenge, and that they’ll be paid for their efforts when they succeed.

More in this series as time allows.

The Pulps Reconsidered, Part 3

SDMCover.jpg

Bet you thought I forgot about this series, huh? Not so: I needed a little time to take a broader look at the field. (Click here for Part 1 and Part 2.) Someone told me that a lot of 1930s/40s/50s pulps were being scanned and posted on Usenet at alt.binaries.pictures.vintage.magazines, so I went up there and pulled down a representative sample. And I’m not talking SF anymore; what I grabbed were things like Air Wonder Stories, Mammoth Western, Strange Detective Mysteries, Adventure, and Spicy Stories.

It’s been wonderful fun. In fact, it’s a lot like watching campy old b/w TV shows, only better, because I can decide how everybody and everything looks. I don’t have to be appalled (or giggle) at the cheap crappy special effects. I just willingly enter a world in which nobody rolls their eyes at a homicidal supermarket butcher about to strangle a square-jawed hero armed with a pistol in one hand and a hacksaw in the other. (See above. No, I didn’t read that story. I still wonder what the hacksaw was about.)

I’m not the first to suggest that the pulps vanished largely because TV took over their niche. The pulps were Saturday-morning movie serials that you could enjoy any time you wanted, and once TV started showing Commando Cody, Bomba the Jungle Boy, Flash Gordon, and made-for-TV adventures like The Texas Rangers, Sky King and Highway Patrol, much of the money went out of pulp publishing. The financial pressure was eventually fatal, but over the short term, as the pulps dwindled, their quality went up. And it wasn’t just that we knew more about science and technology and hence could write better SF. The SF of the Thirties was awful because the readership didn’t care. The pulps had a monopoly on cheap entertainment and people bought it because it was all there was, and reading it was better than staring at the wall.

Print entertainment evolved out of the pulps and into other print markets, particularly glossy mags. The railroad pulps died, but glossy, ad-supported magazines like Trains and Railroad picked up the readership, which after WWII had more money to spend on locomotive picture books and model railroading. Tacky text-porn mags like Spicy Stories (which had racy drawings and a handful of “artful” b/w nude photos) gave way to Playboy and its cheaper imitators as social strictures against visual porn weakened in the 50s. In the late 50s, pulp SF improved hugely, and bootstrapped itself into the new world of mass-market paperbacks by selling reprint anthologies of the best work to come out of the pulp era. (We can be fooled into thinking 30s and 40s pulp SF was better than it was because what we read of it was hand-picked for quality decades after its publication. Read a couple of original SF pulps circa 1935 and you’ll see what I mean.) Crime pulps went both up and down, to comics on the low end (much of “crime” fiction from the Depression was actually horror) and to book-length mysteries on the high end. The romance pulps like My Romance split similarly into gossip mags and mass-market romance novels.

Fewer people may be reading these days, but those who gave it up probably never liked reading that much to begin with. Again, reading was better than staring at the wall, but TV, when it arrived, was easier, especially for people with marginal education. The audience that remained was pickier, and many had been formally exposed at the college level to classic literature, which became the standard by which all fiction was measured.

And that may be a mistake. (I’ll come back to this point in a future entry.)

Leaving the quality of the writing itself as a separate issue, after a good long look around I’d say that the lessons of the pulps are these:

  • The pulps were about specific cultures. They were tightly linked to a time and a place and a generally understood cultural subtext. This was even true of early pulp SF, much of which might be characterized as “Depression-era Chicago on Mars.”
  • Characters were intended as costumes to be worn by readers, not fully realized individuals to be admired on their own merits as independent men and women. A lot of people don’t understand this, and many still won’t admit it. Make characters too vividly fluky and original, and readers will have a hard time identifying with them.
  • As a corollary to the above: Concepts, settings, and action were as important as characters, and much more vivid. Again, it’s the difference between imagining yourself driving a fast car and imagining someone else driving it.
  • The pulps were fun. They understood and accepted their role as immersive entertainment. They were not equipped to be literature and didn’t try to be literature.

With all that in mind, the big questions become: Is there unmet demand today for good-quality immersive (non-literary) fiction? How much of this legacy can we retrieve in 2010 and do well?

More next time.

Daywander

We’re at a dog show in rural Greeley, Colorado, a little north of Denver–and right smack dab next to a huge cattle feedlot. Now, I’m a caveman and a realist–manure is the price we pay for beef–but that stuff sure do stack up and make its presence known. We kennelled QBit and Aero to simplify show logistics, but it’s funny not having Aero with us at a show. He’s a champ now, and the spotlight has shifted to Dash, who at 15 months already has 11 points (of the required 15) and one major win (of the required 2) toward his own championship. Aero was always a shy dog, and fearful at the outset. This cost him points early on, but Dash has never had any such problems, and it’s (remotely) possible that he could score a big enough win this weekend to make him a champion while still technically a puppy. (He won’t reach his majority until 18 months.) Master groomer Jimi Henton will be helping Carol make him and Jack look their best, and we have high hopes.

I just finished a dozen Lucerne eggs and I’m still alive to write about it, so the big contaminated egg thing may not be as horrible as some are making it out to be. But half a billion eggs, sheesh–and that from one company. Am I going to give up eggs? Hardly. My sole gripe is that Lucerne’s are the only eggs I can find locally in Medium, and I’ve titrated myself to a pair of Medium eggs scrambled for breakfast as what best carries me until lunchtime without any energy lapses. So I may have to fall back to a single Extra Large until I can scare up a different brand of Medium eggs. And while eggs are on the table here, does anybody see size Small or Peewee eggs sold at retail? (As best I know, these are generally sold to food producers for cakes and such.)

I bought and have been tormenting a new-ish WYSIWYG EPub editor product called Jutoh, from the guy who gave us the free ECub editor. It’s available for Windows, Linux, and Mac, and I’m testing it under Windows and Ubuntu. My first impressions are generally good, though the product still has a couple of thin spots, foremost of which is an inability to import .DOC files. More on it once I have a chance to get a couple of projects through it.

It may sound odd, but I’m pleased that Jutoh is not free software. It’s only $22, which is trivial–I’ve spent more than that just having an indifferent lunch with Carol at Village Inn. I don’t want to see the category of inexpensive commercial software die out. For a long time it seemed that software was going to cost either zero or a thousand dollars, which would mean that few solo software geeks would attempt to field an innovative utility that would not sell for hundreds but might sell for tens. I bought Atlantis some time back for $45 and love it–it generates the best EPubs of anything I’ve tried so far. (Jutoh is still in the running, but the race has barely begun.) Free software can be superb but all too often evolves slowly, if at all. Zoundry Raven, on which I write this, hasn’t been updated since 2008…though I must balance this by citing the free ebook manager Calibre, which is updated every couple of weeks.

We have lost the Star Hustler. Jack Horkheimer has gone off to see what the stars look like from the other side, and as little as I saw of him (I’ve not watched much TV in the last 40 years) I will say that he did a spectacular job making observational astronomy compelling to ordinary people, especially young people–and as goofy as he seemed sometimes, he never made me want to kick his teeth in, as all too often happens with Bill Nye. Science should not be full of itself (nor, alas, full of something else, as is the case far too often) and Jack was not. Keep looking up…maybe you’ll spot the light of the Big Bang glinting off the top of his head.

Odd Lots

  • Everyone’s talking about a recent Copyright Office ruling that jailbreaking of smartphones is no longer illegal, but few have mentioned that several other significant exceptions to the DMCA’s anticircumvention provisions have been issued in the same ruling. Most interesting to me are limitations on ebook DRM where they prevent audio interpretation of texts from working.
  • Could Popular Electronics be returning? Let us pray. (And thanks to Don Lancaster for the link.)
  • Carol and I have begun avoiding movies in 3D. They give her headaches and they make me seasick. I thought it was just us being weird, but there’s some evidence that 3D isn’t the crowdpleaser that everybody (especially in Hollywood) thinks it is. Here’s some explanation.
  • And even the 2D movies we’ve seen recently seem excessively loud. We may not be imagining things.
  • A new dual-core Android-based tablet by an otherwise unknown German firm is really calling to me. We may not see this one here for awhile (if ever) but if it’s evidence of an evolutionary explosion in Android tablets, I’m good with that. Ours will arrive eventually.
  • I’ve always been taken aback by the near-psychotic venom with which certain people treat an informal, likeable little font called Comic Sans. Scan the Internet and you’ll get a sense for what I mean. From ten steps back it looks like a tribal identity thing: You must slander Comic Sans to prove that you’re a member of the tribe, especially if you’re insecure about your membership. Secure people just keep their mouths shut and use something else.
  • The little red guy running with a hatchet (see my entry for June 27, 2010) appears to be the logo of Psychopathic Records, not the Insane Clown Posse band itself, granting that the label was founded by the Insane Clowns and is probably owned by them. (Thanks to Ricky C on LiveJournal for the tipoff.)
  • I solved another band logo question with the help of Google’s new output format for their Images search. Carol and I saw a band logo that resembled a bright red ballet dancer, apparently headless. I typed “red dancer band logo” into Images and there it was, an emblem of the Dave Matthews Band. I’m starting to like the new Google Images search output because it allows me to scan more images at once, rather than page repeatedly through a more limited matrix. This isn’t always useful, but I’m guessing it’s useful more often than not.
  • Bicyclists in NYC seem to be preparing early for the coming Ice Age.

Odd Lots

Review: The Calibre Ebook Management System

I tried Calibre when it first came out a little over two years ago (v0.4.83) and was reasonably impressed. It did everything it said it did, reliably and without much fuss. Alas, I didn’t test most of its features back then, especially its file conversion modules. I’ve done a lot more in the past week, and overall I’m pleased.

The current version is 0.7.6, and author Kovid Goyal posts updated releases frequently, as often every couple of weeks. That’s amazing for a GPLed app, but Calibre itself is amazing in its way. If you install no other ebook reader or manager, get Calibre. It’s a Python app, and can be downloaded for Windows, Linux, or Mac.

There are three general aspects to Calibre:

  • It’s a sort of jukebox for ebooks: a simple database manager that allows you to browse your ebook collection, search for individual titles, and edit metadata by individual title or in bulk. It can send books to any of a growing list of hardware readers.
  • It’s a collection of import/export modules behind a GUI, allowing you to take an unencumbered ebook in one of a long list of formats, and export it to a different format out of that same long list.
  • It’s an ebook viewer that can render ebooks for reading in most popular formats. When a format isn’t supported, Calibre attempts to launch the associated app to render the book.

All three aspects work well, though I ran into some problems with format conversion. I tested Calibre by importing basically every ebook I have on disk, which at this point isn’t all that many. I still don’t have a portable reader device that I like, and I don’t read a lot on my PC display. So I went and got a bunch of things from Project Gutenberg (including all the pre-1923 Tom Swift Senior books) plus some religion journals and other PD oddments from Google Books, and ended up with about 150 titles.

Calibre copies imported ebooks from their original locations to a separate directory, and it operates only on those copies, leaving the originals alone. (This means that the space your library takes on disk will basically double, though I doubt that this is an issue in an era of 2 TB hard drives.) It controls the filename of each file, and imposes a filename by running a regular expression against the title and author name in its database. Change a book’s title in the database, and the filename changes in sync. Delete a book, and only the imported copy in the Calibre directory goes away. Your originals are not touched.

Once you import the ebooks you own, plan on spending some time editing the metadata. Calibre uses a regular expression to extract an author and title string from each file, and although you can change the regular expression if you want, there’s no broadly accepted standard for ebook filenames, and you’ll find that many of your books have the author name in the title field or vise versa irrespective of the expression Calibre uses. You can specify a series name and number for books in series; e.g., Tom Swift, Sr., Volume 12. There are additional fields for publisher, ISBN, pub date, and comments, and if a cover image is present in a book, a thumbnail will be displayed. There is a tagging system with a tag manager.

Sorting out the metadata was a fair bit of manual labor, even for only 150 books. You can do updates on several books at once; for example, I highlighted all the Tom Swift books and set the Author field to Victor Appleton in one operation. If you have many hundreds or perhaps thousands of ebooks (and I know people who do) good luck; you’ll need it. There is autocomplete on fields and that helps, but there’s an irreduceable amount of keystroking that has to happen to get the most from the database browser.

The ebook viewer is as good as I’ve tested so far. It renders almost every ebook format I’ve ever heard of, including the comic book formats and PDF. (You can configure it to launch an external app to handle a specific format if you choose; for example, I open CBZ and CBR files with Comical.) For EPub and MOBI files, at least, the reader automatically maintains a bookmark to the last opened location in the book, and when you reopen a book, the cursor goes right to that bookmark. (This is not true for LIT, PDB, , and LRF books.)

About the conversion modules I have mixed feelings, and the problems are probably not all with Calibre. I converted my EPub version of the Beyschlag Old Catholic history to LRF, MOBI, and PDB. Results were so-so. One problem with the LRF export was that the font size was inconsistent: Parts of the text were rendered in larger type than others, and I can’t tell (yet) if that’s an issue with Calibre’s LRF viewer module or with the conversion process from EPub to LRF. The conversion to PDB stripped out all the formatting, including italics, and that does appear to be a problem with Calibre. MOBI kept the italics but didn’t center the author lines. Calibre seems happiest dealing with EPubs, and conversion from other formats to EPub works better.

Note that Calibre doesn’t deal with DRM-encumbered files at all. That’s fine with me, as I won’t buy DRM, but you need to keep it in mind if you’re looking to read DRMed books on your PC; Calibre is not the item for that.

I also installed Calibre under Linux, and I moved my entire Calibre database over to the Linux machine by simply copying the Calibre books directory to a thumb drive, and then copying the directory from the thumb drive to a folder in my home directory and telling Calibre to use it. As best I could tell, there were no functional or performance differences between the Windows and Linux versions.

There isn’t a lot of downside to Calibre. Opening and rendering an ebook on the internal reader can be slow if it’s one of the more sophisticated formats. (Txt and .rtf files open very quickly.) The viewer doesn’t downsample cover images very well when displayed at less than their native resolution, though that’s a quibble. (Reduce the display size on my Old Catholic history epub and you’ll see what I mean.) Adding bookmarks seems to take more time than it should, especially on longer books. The program crashed once when I had a lot of windows open. (These included Thunderbird 3, which seems to be causing a lot of weirdness recently.)

Calibre doesn’t help you create ebooks; that’s not what it’s for. And some issues with the conversion modules are going to keep me looking for reliable ways to make MOBIs, LRFs, and PDBs out of my EPubs. However, in terms of an ebook manager, it’s just short of stellar. The viewer modules work reasonably well, particularly with files created “natively”–that is, not converted from one format to another.

Basically, the ebook business is still mighty young, and I’m not surprised at how random things still are. Among ebook-related software products, Calibre is the least random of anything I’ve yet tested, and at this crazy stage of the game, that’s high praise.

Highly recommended.

Daywander

newfawn.jpgIt’s fawn season again, and yesterday we saw a mother deer leading a fawn that was no bigger than Jackie, if perhaps a little taller. Figure that: A deer the size of a bichon. The poor thing can’t be more than a day or two old, and it’s wobbling unsteadily around the First Curve on Stanwell St., where teenagers roar by in their parents’ elephantine Escalades and probably wouldn’t even notice if they had small animals wedged in their grilles. (We’re mostly thankful that they don’t miss the curve and plow through my office window.) Last night about 8 or so, mom had gone off somewhere, and junior was simply lying on our neighbor’s mulch, about six feet from the pavement. It wasn’t as obvious as it could be, but there are much better hiding places in the area. I guess we can think of it as evolution in action.

BrianCarolRingDazzle.jpg

Our nephew Brian was out for a few days last week, and we all went down to the Garden of the Gods for a vigorous walk around the rock formations. I took a photo of Brian and Carol and something very weird happened: A dazzle from one of Carol’s rings just happened to hit the camera the moment the shutter snapped. Green Lantern must have that problem a lot, but this is the first time I’ve seen it from Carol.

My low-key inverted-vee antenna should be up and running off the back deck by Field Day, and will be 32 feet on each leg. That will get me the 20 meter band and up, and given that I’m feeding it with a short run (~10′) of open wire line through an MFJ Versa Tuner 2, I may get 40 as well. I’ll certainly try.

I’m still testing EPub readers. This morning, at Jim Strickland’s noodging, I installed the Barnes & Noble Desktop Reader. Not a bad item, but as with all the readers I’ve tested so far, doesn’t quite get it right. The presentation on my test files has been pretty good so far, but this time the software does use the title tag, and thus puts up only half of the Beyschlag ebook’s title. Also, it puts my test books up in two-column format, and I still haven’t figured out how to control the column settings. Neither of my two test books with cover images show their covers as thumbnails in the library pane. To its credit, the reader renders PDF documents pretty well, though of course there’s no metadata and thus no display of title or author.

Most annoying is the User Guide button, which brings up a longwinded sales pitch but no user guide. I assume you have to sign up for a B&N account to get the user guide, and I will at some point, but probably not today. I do understand that the product is designed to work tightly with B&N’s online bookstore and won’t slam them too hard for that integration, but basic “here’s how to do it” information should be there long before the sales pitches begin.

Nobody’s perfect, but the winner so far is FBReader, even though it inexplicably displays my copyright notices in ancient Greek. We’ll get there. Just not as soon as I’d like.

Coding vs. Compiling EPubs

It’s always unsettling to admit that the other side has a point, but it’s good practice and often absolutely necessary. I am the VDM guy, after all, and I’ve never been one for hand-coding what can be generated automatically. As I’ve mentioned here earlier, an awful lot of people take their text and hand-code an EPub framework around it to create an ebook, which I found borderline ridiculous…until this morning. Now I think I know why they do it.

It’s simple: Our EPub compilers have a very long way to go.

The process of creating EPub-formatted ebooks can be done two ways: Write your own XML/XHTML by hand, or let a utility of some sort generate it for you. I’ve done both in recent days, and I was bowled over by the conceptual similarities between that and the gulf between writing a program entirely in assembly and writing it in an HLL like C. I’ve done a fair bit of tracing through assembly code as compiled by GCC, and I’ve been very impressed by the cleanness and comprehensibility of the assembly files it produces. GCC is one helluva compiler, as is the Delphi compiler. (And that’s where my low-level code tracing experience begins and ends, mostly.)

Well, I’ve been spoiled. Compared to GCC (or even Delphi, which is now 15 years old, egad) the EPub format is a babe in diapers: poorly understood, still growing furiously, and, as often as not, smelly as hell. All of that will pass. (I remember my nephew Brian in his diapered era; he is now 27 and an investment banker.) But in the meantime, well, the immaturity of the EPub technology must be dealt with.

I did another, larger test case EPub yesterday. I took a 15,000-word article from an old theology journal, extracted the text via ABBYY PDF Transformer, cleaned up the text (which was in fact pretty damned clean to begin with; ABBYY does a superb job here) and loaded the text into the Atlantis word processor. Without a great deal of additional editing, I exported it to an EPub file. That file may be downloaded here. (40K EPub.) There are no images, and all the text exists in a single XHTML section. It’s about as simple structurally as an EPub can get, and what you see is just as it came out of Atlantis. I did not tweak it at all post-Atlantis, neither manually nor in Sigil. (Note well that Atlantis can export EPub, but it cannot import EPub files, nor display/edit EPub XML/XHTML.) I then took that file and loaded it into Sigil, added a cover image, and split the text into two sections. You can find that file here. (1 MB EPub.) Both of these files pass EPubCheck without errors.

The Atlantis EPub renders (reasonably) well in all the local readers I have here, as well as the online Ibis Reader. It’s small (only 40K) and if you can do without a cover it’s a perfectly reasonable ebook. The Sigil copy does not do nearly as well. The online Ibis Reader refuses to render any of the images at all, including the cover image, the copyright glyph, and the generated images of the two grapevine glyphs that I inserted into the title page as decorations just to see what would happen. The copyright glyph issue is disturbing for legal reasons, but worse, it’s a standard character with a standard HTML encoding, and should be renderable irrespective of font. Ditto Azardi, which renders the Atlantis EPub well but not the Sigil copy. Over and above Azardi’s leaving out all the images (including the copyright glyph) the Sigil copy of the EPub loses what little formatting it had in the Atlantis EPub. None of the centered text remains centered, for example.

There are some additional weirdnesses in the readers themselves: FBReader renders both files well, but (weirdly) the Go Forward button moves the reading window toward the beginning of the file, and the Go Back button moves the window toward the end of the file, perfectly bass-ackwards. Ibis displays the title three times, which is overkill. FBReader handles the images just fine, but renders the copyright notice for both versions in Greek letters, sheesh.

These rendering issues are probably reader failures, since the files themselves are EPub-compliant. However, the autogenerated XML/XHTML code is often obscure, and in one case, at least, dead wrong: The title tag includes only the first line of the title. I understand that the title text is split into two lines, but I was never asked to define the text within the title tag and can only assume that Atlantis picked the first Heading 1 style it found and plugged its text into title. (The metadata for the title was stored correctly, and all readers displayed the full title text. I don’t think that the title tag is used by the readers. An empty title tag is perfectly acceptable to EPubCheck.) The gnarliest part of the compiled EPub (in both versions) is the CSS. Atlantis took the page format settings and translated them into generically named CSS classes, which are accurate representations of the word processor settings, but not easily identifiable and in no wise good quality CSS.

This isn’t insurmountable, and most of the problems I’ve had so far can be blamed on incomplete and buggy reader apps, but it shows how young a business this is. The hand coders still have the edge, and I’d be better off on the readability side creating the ebook text in a WYSIWYG HTML editor like Kompozer or Dreamweaver and hand-coding the CSS myself. That is, however, precisely what I’m trying to avoid. Sooner or later, Atlantis or something like it will offer pre-written CSS style sheets designed specifically for text intended for EPub export. That will help a great deal. In the meantime, some manual futzing is unavoidable, and my opinion of Sigil has been greatly tarnished. I may have to try something else on the EPub editor side; suggestions always welcome.

And the readers, yeech. Don’t get me started. I may have to buy an iPad just to see what my own damned books look like!

Odd Lots

  • The rate of toxoplasmosis infection in a given nation appears correlated to the level of neuroticism in that nation. I’ve mentioned toxo before, but it appears that we have better numbers now, and that the UK is nowhere near 50% infected, as the source I quoted in 2003 implied. France, well, now…
  • Here’s a nice piece that explains why the atmospheric CO2 measurements taken atop Mauna Loa are accurate. And here is NOAA’s explanation of how they do it.
  • Bruce Baker sent a link to an article describing how a few scraps of odd film allowed some dogged engineers to re-create the long-lost pallophotophone technology (later known as RCA Photophone), and by doing so give voice to 1920’s recordings of Thomas Edison speaking affectionately about his friend Henry Ford.
  • This weekend is ARRL Field Day, in which ham radio ops head out to the hinterlands to see how well they can get up to radio speed from a dead stop, using portable (not mains) power, from a place not previously set up for radio gear. I’ll be trekking out to my back deck and working the world on an inverted vee, draining 829Bs (Diet Mountain Dew over ice in peanut butter jars) Saturday afternoon, and showing all my geek friends how it works that evening during one of our semiregular geek parties.
  • We can look forward to the Roman Missal on iBooks soon. But will the priest process down the center aisle holding an iPad over his head?
  • I’ve been testing Windows-based EPub-capable reader apps for the last week or so, and guess what: They all suck. Bigtime. Why is it so hard to render reflowable documents that are basically HTML-in-a-sack?
  • If you’ve seen The Music Man as often as I have, you’ll remember how there’s trouble in River City, because the kids are memorizing jokes out of Captain Billy’s Whizbang. Well, I’d long thought that Meredith Willson had made it up, but not so: I found a scan of a 1921 issue of Captain Billy’s Whizbang (which is now in the public domain) and put it up in my pub directory so you can see it too. (Note: It’s a 15 MB .cbr.) My reaction? It’s not very funny, but in a world without Lileks, I guess people laughed at whatever they had on hand.
  • Ok, there’s a little profanity in it (like that’s unusual in Slashdot comments?) but damn, I like this one.

Atlantis and the EPub Toolchain

You’ve heard me say this before, and I suspect you’ll hear it again and again: Creating ebook files is much harder than it needs to be, and creating ebooks in the EPub format is particularly–and inexplicably–hard. In my June 9, 2010 entry, I spoke about the EPub format itself, and how it’s not a great deal different from a word processor file format. In fact, Eric Bowersox pointed out that OpenOffice’s ODF files are also based on XML and organized in a similar way.

Bogglingly, most people appear to be hand-coding EPub XML. In recent days I’ve been looking for better ways to create EPub ebooks. Many places online cite Sigil as the only WYSIWYG EPub editor in existence right now, and I grabbed it immediately. It’s a very nice item, but appears to be an undergraduate’s Google Code project, and I certainly hope he will hand it off to others if he ever gets tired of hammering on it. Version 0.2.1 has just been released, and it fixes a number of bugs that I stumbled over in the last couple of weeks that I’ve been using it.

Then, yesterday, without any need for ancient maps or Edgar Cayce, I found Atlantis.

The Atlantis word processor is a $35 shareware item created by a very small company in France. It’s portable software, meaning it can live on a thumb drive and does not have to be installed in the usual fashion. It’s tiny; nay, microscopic (the executable is 1.1 MB!!) and lightning fast. It doesn’t have all the fancy eye candy of modern software, but it’s amazingly capable, and highly focused on the core mission of getting documents down and formatted. It has a spellchecker and other interesting features like an “over-used words” detector. It reads and writes .doc, .docx, and .odt (ODF) files, and here’s the wild part: It exports to EPub.

Furthermore, it does a mighty good job of it. I loaded a .doc of my story “Whale Meat” into Atlantis and then exported it to EPub. The generated EPub file passed the very fussy EPubCheck validator immediately with flying colors. Now, this was pure text, without any images or embedded fonts or other fanciness, but that’s ok. You have to start somewhere, and I would prefer to start with a genuine word processor.

I then loaded the EPub file that Atlantis had generated into Sigil, which I used to divide the story into chapters and add a cover image. Sigil isn’t really a word processor in the same sense that Atlantis or Word are, but it allows split-screen editing of WYSIWYG text on one side and XML/XHTML code on the other. Sigil 0.2.0 had a bug that generated an incomplete and thus illegal IMG tag (XHTML requires the ALT attribute) but I see that the new 0.2.1 release fixes that. Adding the ALT attribute manually in Sigil 0.2.0 allowed the EPub file to pass EPubcheck without further errors.

I have not yet generated a TOC in Sigil, nor have I attempted to create an EPub of any significant size. (“Whale Meat” is only 8,700 words long.) When I’m through playing around, I’m going to load the entire .doc image of Cold Hands and Other Stories into Atlantis, export it to EPub, semanticize it in Sigil, and see what I have. At some point along the way I may be forced to hand-code (or at least hand-correct) the XML or XHTML, and you’ll hear me bellyache about it when I do. But I will admit that I’m pleased with what I have so far. Yes, Atlantis and Sigil ought to be one product, or at least two closely-knit utilities in the same product family. Still, given the primitive state of the EPub reader business (I have yet to find a Windows or Linux-based EPub reader that I’m willing to use) I’m satisfied with the way that Atlantis and Sigil cooperate. Now that Apple has anointed the EPub format for iBooks, I’m guessing that EPub-related improvements will be arriving thick and fast in coming months.