Sandra And Woo do the Voynich… The Book of Woo!

Online webcomic Sandra And Woo has just taken a detour into CryptoLand, with a Voynich-inspired page called The Book of Woo to celebrate its 500th edition. What’s more, author Oliver Knöerzer (AKA “Kernel River Zoo”) has offered a $250 reward to “the person who is able to provide a decipherment that’s sufficiently close to the plain text“, plus “another $100 to two charities determined by the readers who contributed the most useful information for breaking the code.” Really, Oliver, I’d have helped regardless. icon wink Sandra And Woo do the Voynich... The Book of Woo!

The Book of Woo’s most obvious predecessor would seem to be the Codex Seraphinianus, which is also “primarily a work of art, not a puzzle for the general public“, though I wouldn’t describe the Book of Woo as being quite as hardcore as that (but then again, what is?). The Vick Industries cipher seems to be a more design-oriented mindset entirely, though the art-house rationale behind that has yet to emerge into the light.

How is anyone supposed to decrypt The Book Of Woo? Helpfully, Knöerzer does throw a handful of hints in our path, though mainly about what it isn’t rather than about what it is. He says:-

* The encryption isn’t based on an algorithm only suitable for computers which executes a loop 100 times or something like that.
* The encryption isn’t based on some sort of device or mechanism that is hard to get.
* No “classical” steganographic method was used since that would just be impossibly hard to crack.
* The plain text is some sort of literature, as one can guess from Woo’s comment and the illustrations. A lot of time went into the plain text as well, it’s not just a copy of the first page of Rascal or something like that.

But he also warns that “[if] you think you can simply carry out a frequency analysis on the letters and be able to reconstruct the English or German plain text this way, well, that’s just a waste of time.” Indeed, even a brief look at the text reveals blocks of characters arranged in a very artificial CVCV (consonant-vowel-consonant-vowel) manner. There are also quite a few patterns that are repeated multiple times: here’s a colourized section of the first page, so you can see a bit of what I’m talking about…

book of woo page 1 colourized cropped Sandra And Woo do the Voynich... The Book of Woo!

What’s going on here? Well… I’ve had a few brief email exchanges with Oliver recently, so have possibly at least a flicker of an idea. And given that he has already openly flagged both the Voynich Manuscript and my book on it (The Curse of the Voynich) as having been useful (he’s even reused the Voynich’s “T” gallows character in his cipher alphabet), it probably wouldn’t hurt to recap a few Voynich-related observations here. icon smile Sandra And Woo do the Voynich... The Book of Woo!

The first thing to say about ‘Voynichese’ (the structure that shapes the Voynich Manuscript’s text) is that there seem to be two main schools of thought: (a) that it’s a cipher system that for some reason our statistical toolkits aren’t able to help us much with, and (b) that it’s a real language but we’re too in love with our analyses to see the bleedin’ obvious.

(For the record, I’m in the (a) camp, which means that when I look at a map of all the different types of Voynichese evidence, I want to understand what kind of trick was used to confound all the different statistical tests, rather than throw my hands up in the air and say “Stats, shmats!”.)

The second thing to note is that almost all of the Voynich Manuscript is written using a very compact alphabet (roughly 22 characters), whereas The Book of Woo uses something like fifty unique shapes (I haven’t transcribed it yet, but that’s how it looks). What connects them is that they are both very predictable at the character level… up to a point. That is, in some circumstances you can reliably predict what the next character along is going to be, but in other circumstances predictions can be of little use.

(For what it’s worth, I believe that it is this specific combination of predictability and unpredictability that convinces people that Voynichese is a language, whereas real languages only tend to work like that in a few very specific ways, e.g. “q” almost always being followed by “u”.)

Trying to account for this property ultimately led me to conclude that the Voynich Manuscript in part uses “verbose cipher”, i.e. employing pairs or groups of letters to encipher single letters in a misleading way. For example, the Voynichese letter-pair “or” gets repeated immediately after itself a number of times, with the best known examples being on page f15v:-

or or oro r Sandra And Woo do the Voynich... The Book of Woo!

Do any real-life languages do this? I don’t think so, but that remains a matter of opinion.

The Voynich Manuscript has a large number of extra curious properties that I believe point to other tricksy mechanisms (e.g. in-page transposition of some sort, if you please), but my suspicion right now (based only on having a nose around it) is that Oliver may have found these unnecessarily abstruse to build a cipher around.

No: I think what’s going on in The Book Of Woo will turn out to be largely based around verbose cipher – specifically a combination of paired letters. Having said that, the big problem with a simple verbose cipher is that it is, well, as verbose as it sounds: and so to make it not bloat as badly as a Microsoft application, it needs some compression tricks to be used at the same time.

In the case of the Voynich Manuscript, I suspect that verbose cipher gets combined with the kind of scribal abbreviation in use during the 15th century. Similarly, because the overall word-length isn’t too extreme for The Book of Woo, I suspect (a) that certain letters used at the start or end of words will encipher prefixes or suffixes, somewhat like a kind of shorthand; and (b) that it’s more likely to be English than German. icon wink Sandra And Woo do the Voynich... The Book of Woo! It may well also be that certain letter pairs themselves encipher common letter pairs or even letter triples (such as “the”): these are the kinds of tricks I’d expect to see here being used to disguise the structure.

And yet… words seem to be words (i.e. it’s an aristocrat cryptogram rather than a patristocrat cryptogram), so it’s very much as if he wants to help us, not hinder us. So even though it looks a bit tricky at first glance, maybe it will all fall out nicely in the end. We shall see, hopefully before issue #1000! icon wink Sandra And Woo do the Voynich... The Book of Woo!

15 Comments

  1. avatar Novil July 30, 2013 4:57 am

    Thank you for the detailed article, Nick. I want to add that several readers have already posted interesting facts, such as a word frequency analysis, about the text in the comment section that might be helpful to decode it.

    http://www.sandraandwoo.com/

  2. avatar Diane July 30, 2013 5:40 am

    Speaking of *that* manuscript, on gloomy days I cheer myself up by remembering that by Voynich standards, you, I and everyone else are authorities on the Mahābhārata.

    No, I can’t read a word of it, but I can say most confidently that its language is Sanskrit, and its pictures look epic-like.

    :)

  3. avatar Diane July 30, 2013 6:03 am

    Pictures second page
    [Greek] Key ring / carabiners aka ‘biners’,

    dead man’s hand / hand of glory,
    * hand of glory should be the left hand

    … racoon (on a hot tin roof?).

  4. avatar Ruby Novačna July 30, 2013 10:36 am

    Hello Nick!
    In the Voynich text you are quoting, two letters are clearly distinguished: one has the straight leg and the other has the curved leg.
    If we read the straight letter as “r”, the letter to the curved leg will “?”.
    If we propose “n”, for example, it will be “Poron Orsha …” – “To fall nuts (seeds)” and the second “Xeionon onoram …” – “to honor of Xeionon …”. or well Zeionon ?
    Ruby

    http://readingvoynich.wordpress.com

  5. avatar Diane July 31, 2013 11:41 am

    I received a ‘spam’ email which appeared to come from another Voynichero. My own was then affected, but is now clear.

    The virus seems only to infect emails:

    http://securitywatch.pcmag.com/google/283638-gmail-filter-virus-makes-you-a-spammer

  6. avatar thomas spande August 2, 2013 8:01 pm

    Dear all, Looks to me like someone has gotten heavily into Tironian notation! I lack the motivation even for big prize money to go there! Cheers, Tom

  7. avatar bdid1dr August 3, 2013 9:18 pm

    I see only uncertainty in forming the two look-alike Vms characters for the alphabet letters “R” and “S”. I see uncertaintly because one character is ALMOST a mirror image of the next character of the “alphabet”: “R” looks almost like a mirror-image of “S”. I’ve mentioned several times that the sibilant “s” is represented by a Cyrillic capital “c” which “looks like” an Ottoman/Turk “sickle”-shape (or a waning moon with a handle extension).

    The “R” is represented by what “looks-like” a backward-facing “S”. So, if someone wanted to write the “Voynich word for “Rose”, all one would have to do is write three characters:

    S (backwards) o (sickle) — that’s it. r o s

    So, can you now see just how confused a “newbie” scribe might become when taking dictation, say, in Turkish while trying to translate the Turkish into Latin, for the benefit of his Flemish client?

  8. avatar bdid1dr August 4, 2013 8:26 pm

    A tentative translation (latin) for the sample you’ve partially shown: could be read as sep-a-rar-ios-ce-am. “to separate or distinguish”

  9. avatar bdid1dr August 5, 2013 4:00 pm

    I’ll xpand my discussion (once more):

    Those elaborate “P” symbols, which most often appear at the beginning of many “Voynich” folios, are in fact “P”.

    What is most interesting, to me, is that depending on the subject matter being “Pres”ented, whether “Pot”anical,”Esp”ecies, or “Ph”armaceutical, or “Pab”lumox, this “P” character is “Pro-BaB-ly” the most Powerful syllable in the entire manuscript. Because whole syllables can be added to the “front” of the “P” (Especies) (Explanation) as well as the SamPles I have just PResented, I hope I can PRove my case with this most current discussion which I have demonstrated on at least four of Nick’s manifold pages.

    Now there is another matter for discussion: the difference in the “M” and “N” ciphers……..

  10. avatar bdid1dr August 7, 2013 12:27 am

    Nick, if you were referring to the “T”-gallows as being the Vms character which has a loop on each leg, I’m hoping you haven’t forgotten my findings as far as the Vms character which has a loop above each leg represents the sound of el or ell. The Vms character for the sound of “tl” is almost the same as for “el” BUT it has a loop only the right leg. What has puzzled me all along is that I find no Vms character for the sound of LT.
    So, I 1dr if Sandra and Woo may have been 1drng as much as I have. ???
    bdid1dr

  11. avatar Diane August 10, 2013 3:20 pm

    Nick,
    I went to look up again a reference I gave members to the Voynich mailing list some months ago, but the gameszoo ‘voynich monkeys’ site seems not to be visible online.

    Do you know if the problem is temporary, or has the mailing list gone private?

  12. avatar Porter August 11, 2013 3:05 am

    Speaking of Voynich, could just a portion of the Voynich Manuscript be in a language which, for whatever reason, we have no access to? Maybe there some bits of an extinct language entwined with all the other words written in languages that we know about. That’s just what I would think given what you said about our cryptographic methods being super bad at dealing with, combined with the reasons you stated for it not being an entirely new language.

    Honest curiosity, I’m not a cryptanalyst, I don’t even know how that would work. Or even whether it would.

  13. avatar bdid1dr August 13, 2013 11:10 pm

    Nick, Diane, Porter, ThomS et al:
    I’ve just finished a couple of hours on a website called “Metamedia”. My very rapid-reading of some 120-pages of the document answered several questions and confirmed my translations of several folios, and especially a whole lot of Ottoman and Byzantine history (Bazeyid, Suleiman & his advisors, and Busbecq’s travels). I can’t bring up the url for this extract from the website “Metapedia”.
    So, if this interests you at all, plug in the terms Ogier Ghislain de Busbecq – Metapedia

    I’ve pretty much ruled out Clusius as being the writer of Boenicke manuscript 408. Dodoens doesn’t fit either.

    More squinty than ever! :-)

  14. avatar Diane September 3, 2013 11:32 am

    NOT an ad.

    I see that the Voynich ms is now available as e-book

    http://shkspr.mobi/blog/2013/08/voynich-manuscript-ebook/

Leave a Reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>