Towards a Lifelong Content Management System
What follows is a bit of a rant. If you’re passionate about CMSes, you’ll probably find something here that resonates with you. But I don’t have the solution yet. With that disclaimer…
I expect my content to grow with me for the rest of my life, and I want the same from my CMS. I’ve tried Plone, MediaWiki, WordPress, Serendipity, and others, and none of them have everything I’m looking for:
- expressive markup (LaTeX, Graphviz)
- multiple output formats (HTML, PDF)
- clean URLs
- comments (with threading)
- editing (“offline”, revision control)
There are now more than 1,000 published CMSes!1 This number has doubled in the last 3 years.2 This tells me that no one has gotten the formula right yet. And I think I know why.
What happened to the Unix philosophy?
The most frustrating part of looking for a CMS is finding one that fits most, but not all of your requirements. MediaWiki is good at versioning, LaTeX, and Graphviz, but it doesn’t handle blog-style functionality like comments and tagging. WordPress is good at syndication, tagging, and comments but doesn’t handle complex articles.
Can we really find what we’re looking for among these monolithic systems that try to do everything? Even if you found a CMS that did everything you want right now, how do you know that it will still do everything you want next year? Technology is constantly changing, and so might your personal desires and needs. Do you want to tie yourself to the whims and schedule of a single team of programmers? Do you want to commit yourself to making changes to a complex system written in a single programming language, perhaps one that you’re not fond of?
The CMS as a complex multi-cellular organism
What constitutes a CMS? What are its smallest components?
I’d argue that CMSes focus more on managing data about content than the content itself. Content seems to be expected to look like a typical forum post: some headlines, a little bit of emphasis, and maybe a couple images.
There are a number of useful document formats out there for authoring content in:
- Markdown, Org mode, and other lightweight text formats can be converted to many output formats by tools like Pandoc.
- LaTeX excels at mathematics, diagrams, and typographic style.
Why are we reinventing them or presenting authors with a textarea for HTML input? What if one of these doesn’t completely satisfy your needs. Why do you need to stick to 1 format for everything you write? Isn’t the output format all that matters to your readers? If all of these tools can output what you need, why can’t you use them all?
HTTP will be the protocol over which people retrieve content for the forseeable future. But that doesn’t mean that we should assume that that will always be the case. Why not pull the HTTP interface out into a separate component as well? This allows us to begin to think about HTTP in its correct terms: as a distribution system. In this new paradigm, email is also a distribution system, as are printed documents.
Versioning and revision control
“Permalinks” aren’t that useful to other people citing your content when your content might change over time. A permalink is missing a vital component: a version number. But most CMSes don’t support versioning.
It’s also useful to be able to track diffs of your content, especially for people who want to receive updates or see what’s changed. Wikis tend to do a good job at this, but most CMSes have no support at all.
CMSes are generally designed around relational databases. Relational databases don’t generally have built-in versioning. This means that when a CMS implements revision control (which happens rarely) it tends to create a new way of doing so.
What about using your favorite versioning system instead?
I notice a divide among online writers. On one side you have bloggers. Frequency, brevity, and relevance to current events seem to be their focus. On the other hand you have academics and authors. They focus on writing a collection of longer, less ephemeral pieces. But rarely do we see a combination, and when we do they’re being handled by multiple systems: static HTML pages for the main site with papers and articles and a blogging system living under
Why this dichotomy? Why can’t your content management system have both? What happens when you want people to be able to comment on your articles? The easiest solution seems to be to cram your articles into your blog software. But this has negative side-effects.
Let’s build a system that treats all content types as first-class citizens. Allow people to rate or subscribe to updates of journal entries as well as articles.
Metadata and interaction
Content becomes more valuable when it’s interpreted by others. We’re all beginning to realize this, but we still don’t have general and interchangable systems for doing so.
Most blog commenting systems are insufficient for anything but brief, short-lived posts. First of all, they aren’t generally threaded. This makes having a large meaningful conversation over a long period of time more difficult.
What about karma/voting systems for comments? Wouldn’t it be nice to have the most relevant comments about your article sorted to the top so that they appear immediately following your content?
This article published with my lifelong CMS
The article you’re reading was created and published in the style I’ve described above. It’s still experimental and how it’s built and hosted has changed over the past 10 years, but the content hasn’t. You can see how it’s evolved on GitHub.
- Publishing formats
- Pandoc’s Markdown
- Publishing system
- Nix, GNU Make, Bash, jcoreutils, jsonwrench
- HTTP dispatcher/interaction
- Revision control
http://www.cmsmatrix.org/ lists 1,049 CMSes as of 2009-04-25. This number does not include many wikis!↩︎
Jono Bacon. Context vs. Content. 2006↩︎