Zettelkasten

Baseline for Zettelkasten Software Reviews

I’m going to take a close look at applications to find out which are suitable to implement the Zettelkasten note archive. I already talked about reference managers. While reference managers can be switched pretty easily, migrating a database of notes is far from being a trivial task, depending on the software you used in the past. Therefore, we have to chose how to implement the note archive with great care. Here you’ll find my criteria.

In the comments, you can contribute to this list.

Criteria I Will Consider

First, we’ll see what the bare minimum features are. What does any note archive software has to achieve to be useful?

To find out, recall the essentials you have to adhere to when you want to create a Zettelkasten to be your second brain:

  1. Use hyper-textual notes: Notes should both link to one another directly to create a web and use keywords to form clusters with indirect relationships.
  2. Write a lot: Feed your Zettelkasten and make it grow. With time you will be surprised by the wealth and interconnectedness of information. There’s an estimate threshold which people assume to be at around 1000 notes. After that, your information retrieval will change.

There should be no obstacles when it comes to writing notes. The software has to make it effortless to add new notes and to connect them. To get knowledge out of the Zettelkasten, it has to be fast at retrieving notes, too. When these tasks were slow, you probably wouldn’t use the note archive enough.

The key elements I look at will be:

  • Note retrieval: for example, how fast and easy is full-text search?
  • Note creation: does it take many clicks or keystrokes to create a new Zettel note?
  • Note connections: which mechanisms does the app support to create connections? I’m leaving this point pretty vague intentionally. I know of various ways different applications deal with this problem. Also, I’m going to cheat a bit: if full-text search works, manually linking notes will work, too: just put the target’s identifier somewhere, copy it, search for the identifier, and open the resulting note.
  • Data export: nobody wants to be locked into a data silo. How does the application manage to get everything out without losing information? How does it manage to get stuff in?

Every software has its own features and limitations, of course. Wiki-like software makes linking painless, while others provide sophisticated search mechanisms. I will evaluate these individual properties too, of course, but only in respect to their usefulness for managing a Zettelkasten note archive.

So how will notes look like? How will I compare?

At first, let’s pretend that there’s no such thing as rich text. Some editors will support emboldening and italicizing of text like Microsoft Word does. Some will even support links to other notes you can click on. I consider these things interesting, but not mandatory.

Instead of searching for the perfect app for an ideal note, we should focus on the content and the data, the note text and the note’s attributes. After all, the Zettelkasten method works with paper, so why worry about the latest fad in computing? I think it’s more suitable to begin with the lowest hurdle possible, see which applications outreach the expectations, then allow to be surprised.

The Bare Minimum of a Zettel Note

Update 2020-10-03: I’ve updated the note format and replaced the screenshot with a more recent example.

Every note consists of various parts: it needs to have an identifier for linking, it needs to have body text or content to actually say something, and it may have some front matter and back matter to provide more info about the note or its sources.

To ensure that you and I talk about the same stuff when we use these terms, take a look at this example note of mine called 201910091801 Struggling when you acquire a skill could be important to become great in your field:

example note
Anatomy of a Zettel: a partitioned example note

In this example, the note is partitioned as follows:

  • The identifier is 201910091801, which stands for the 2019-10-09 at 18:01. It’s part of the file name and the note.
  • The body text consists of the first introductory sentence, and the list of details. You see there are multiple links to other notes that provide more details on the summarized list items: 201910091746 points to the note “Savants excel at their skill without innovating”.
  • The front matter or header (because it’s at the top of the note) is about metadata like the creation date. That is part of the date-based unique identifier. In my case, it contains a list of tags, too. Like I said, you could put the tags in the running text, too (“inter-textual tags”). This is just my personal preference and makes it easier to scan for relevant keywords, I find.

      # 201910091801 Struggling when you acquire a skill could be important to become great in your field
      #expertise #innovation
    
  • The back matter should contain references (and in case you write your notes in Markdown, it can also hold link definitions and footnotes). It may include a dedicated list of links on the same topic if you want to add those. In this example, there’s a literature reference in the MultiMarkdown format:

      [#epstein2019range]: David Epstein (2019):
      _Range. why generalists triumph in a specialized world_,
      New York: Riverhead Books.
    

As you see, I use plain-text notes. The syntax or “language” I use to format notes and add references is called MultiMarkdown. This isn’t mandatory; I only tell you so you know which particular details I settled on.

Now that we’ve taken a look at the example, we can strip away my personal twists on saving notes and focus on the essentials.

I want to establish an all-purpose note layout as a baseline in a way that every text editing software on this planet can cope with it.

The all-purpose, one-size-fits-all note

This one is suitable for every editor which supports saving text.

The body text or content dominates. Citations work as they do in print: you put “(Author YEAR)” in the appropriate places and append a readable list of references in the back matter. I put tags into the back matter as well so they get out of the way when reading the note; the topmost lines are precious screen real estate I don’t like to waste.

If you drop all the Markdown formatting, this is still a viable example:

201910091801 Struggling when you acquire a skill could be important
to become great in your field

Being good at something is not enough to be an innovator:

- Savants excel at their skill without innovating.[[201910091746]]
  So merely being good at something is not enough to innovate and
  be creative.
- Self-taught jazz musicians sound more interesting than their peers
  from jazz music school.[[201910091737]] "More interesting" can
  entail being more creative, too; Epstein writes about a lot of
  great musicians being self-taught.[chapter 3][#epstein2019range]

#expertise #innovation

---
References:

David Epstein (2019): Range. Why generalists triumph in a
specialized world, New York: Riverhead Books.

This is perfectly readable in any application that can display text.

Here’s two example pictures of similar notes – one in a plain text editor, one in a rich text editor:

View in a plain text editor vs rich-text editor with minimal formatting

Going plain text with MultiMarkdown

We’ve seen the usage of MultiMarkdown already when I showcased my own note. I use a text editor which is Markdown-aware and gives me some visual sugar to improve scannability. Visual aids are all rich text is about in the end. Just compare how the same note looks with and without Markdown-specific coloring:

comparison of syntax highlighting
Markdown syntax highlighting enabled and disabled

Plain text has its benefits, especially when it comes to longevity: while every other file format might go out of fashion at some future time, plain text will likely stay.

On the other hand, one could say that as long as Microsoft exists, there’ll be a Microsoft Word. If Microsoft as a company outlives us, then we’re going to be able to rely on .doc files for our entire lifetime. Vendor lock-in is only a problem when the company stops to support the application – but that seems unlikely in the case of Word:

So yes, [plain text] should last forever. But I think Word – like jpeg for photos – is going to be around for a very long time too. And it’s worth remembering that for writers these are intermediate formats – in the end you wind up as digital media, whether it’s Kindle, mobi, epub or whatever, all of them capable of being reverse engineered. The chances of being lost are, it seems to me, are extraordinarily remote.
(David Hewson)

When you use a plain text markup language like Markdown (or the MultiMarkdown flavor), you have a common standard to transform your text notes into something else, like PDFs for example.

I discovered MultiMarkdown because it is built into Scrivener. It supports metadata in the front matter or header which doesn’t show in HTML previews and exports to LaTeX nicely. Nowadays, there’s other means to get this feature, enabled by the popular Pandoc document converter.

Going plain text with Pandoc

While I am a plain text aficionado, MultiMarkdown is not a flavor for everyone. Pandoc is a popular conversion tool to create Word .doc files or PDFs from plain text input, even MultiMarkdown files. Transforming to nearly every possible format on earth useful for sharing documents with mere mortals.

If you are fond of Markdown but don’t like MultiMarkdown’s peculiarities, you can rely on Pandoc’s markup language support for conversion and utilize its all-purpose metadata format. The metadata is embedded in a different markup language called YAML. The front matter of my example note from above would look like this in YAML:

id:    201910091801
title: Struggling when you acquire a skill could be important to become great in your field
tags:  [#expertise #innovation]

Not much has changed. This is a self-contained YAML document. The nice thing about YAML is, though, that you can embed them into every text file if you put a line with three dashes (---) above and below the YAML content.

Here’s an example note with the full bibliography reference put in YAML front matter:

---
title:      Improve the structure of essays by rewriting
date:       2011-05-16 09:58:57
keywords:   [#writing, #essay, #practice]
references:
-   id: isaacson2004bf
    title: Benjamin Franklin. An American Life
    author:
    - family: Isaacson
      given: Walter
    year: 2004
    location: New York and London and Toronto and Sydney
    publisher: Simon and Schuster Paperbacks
    type: book
---

Benjamin Franklin improved his writing skill by re-writing other author's articles:[@isaacson2004bf p. 28]

*   Take notes when reading on single scraps of paper.
    (Like in the reading workflow for a #Zettelkasten)
*   Shuffle the notes sometimes
*   When the original content is forgotten, write an essay based on your notes
    and compare to the original
*   Sometimes try to write in verse instead

Cf. <http://www.earlyamerica.com/lives/franklin/chapt1/>

With Pandoc, you can create a PDF from this note with a real bibliography at the end because all the information is contained in the reference list of the YAML front matter. Alternatively, you can attach a BibTeX file with all the reference information. As I said, I like to preserve space at the top of the note, so this is the path I’d chose.

I think notes should be self-contained. If you don’t add the verbose reference information in the YAML front matter but use a BibTeX file instead, I recommend you put at least a basic textual representation of the reference in the note file just in case. This representation could look like the one I used in the one-size-fits-all example: Walter Isaacson (2004): Benjamin Franklin. An American Life, New York and London and Toronto and Sydney: Simon and Schuster Paperbacks.

Software I (Want to) Look at

gears
Photo Credit: nic_r via Compfight cc

This is a running list of applications I want to look at. I’m going to update this list over time and as your suggestions come in. Also, I’m going to link to reviews from here as soon as they’re done.

The current list of application candidates, in alphabetical order:

  • Bear (Mac)
  • Clibu (Web-based, platform independent)
  • ConnectedText (Windows)
  • DEVONthink (Mac) by Marko
  • Evernote (platform independent)
  • Emacs org mode + Deft (platform independent)
  • MacJournal
  • Notational Velocity and nvALT (Mac) by Christian
  • One Note (PC, Mac, even mobile) by DutchPete
  • PiggyDB (Java, platform independent)
  • P.S. Notes (Linux, Ubuntu)
  • ResophNotes (Windows)
  • SublimeText (platform independent)
  • SlipBox (Mac)
  • Tinderbox (Mac)
  • Trello (web) by Nick
  • WikidPad (platform independent)
  • VoodooPad (Mac)
  • vim with NVim (platform independent)
  • Zkn³ (platform independent)
  • and a paper-based Zettelkasten

I currently don’t have a Windows machine at my disposal. If you would like to help out with testing, just drop me a line.

Contribute: What’s Your Suggestion?

So here’s your call to action. Participate in the comments below:

  • Which criteria for testing would you like me to add?
  • Which software would you like to add to the list?