by yumiris on 3/2/22, 10:59 AM with 128 comments
by dang on 3/2/22, 6:20 PM
Write plain text files - https://news.ycombinator.com/item?id=30521545 - March 2022 (345 comments)
by nonrandomstring on 3/2/22, 1:08 PM
The author invokes the concept of "authenticity", and that's where it gets interesting.
I used to set my students a question about information content in a class on the philosophy of procedural representation.
We had a very high resolution photo of the aviation pioneer Amelia Earhart, and a short grainy video clip of her getting into a plane and smiling and waving.
My question was: Which one of these two media conveys more information about Amelia?
One gave extraordinary detail of her face, eyes, and seemed to many was a much better "fidelity" document. Others noticed that although you couldn't see her face in the video, you could feel from her gait, waving, body language and the way she shook hands _much more_ about her than from the static photo.
Both files are the same size in bytes.
So which one has more "information"? Which one is more "authentic"?
Not to attempt to answer here with a deep dive into phenomenology, but each carries a different kind of information, which can be static, dynamic, or meta-dynamic in higher orders relative to a matrix of assumptions that must be carried forward in parallel by the culture that wants to decode the message later.
I like that Miris tries to explore this by questioning the richness of text. But maybe the question doesn't hold up well under those conditions of investigation - because one might say that a great poet using only a few words might capture a landscape better than a painting, but if our culture drifts toward a visual one where poetry is no longer understood we cannot say that the medium itself degraded.
by thomascgalvin on 3/2/22, 2:43 PM
I don't think anyone really argues that everything should be plain text, even if that's an easy shorthand. The real argument is "use the simplest, most open format possible."
Nobody is suggesting you go through all of your photos, transcribe your emotional reaction to each picture, and then delete the image. But, if you want to view those same photos when you're fifty years old, or seventy-five, you're better off storing them as a JPEG than a PSD, and you're better off storing them on a hard drive you have access to in addition to whatever cloud they're currently occupying.
"Write plain text" is a shorthand for "use open formats." Because so much of what this audience does is test-based, plain text is the most common format we use, from source code to journaling, but that message applies to pretty much anything: if you lock yourself into a proprietary format, or a proprietary editor, you will almost certainly lose data over the long term.
by llarsson on 3/2/22, 12:23 PM
by eatmygodetia on 3/2/22, 2:27 PM
But maybe we should all use monochrome bitmap files for everything? That would be very simple.
by yumiris on 3/2/22, 11:06 AM
Will re-re-re-revise it again with fresh eyes after resting 'em!
by aasasd on 3/2/22, 12:39 PM
Thankfully I'm using Org-mode these days, which is reasonably ‘plain text’ under practical definitions—but I make dozens new headings every week, and each of them is stamped with the creation time. But boy do I miss having modification times too—should probably finally set up automatic commits to Git. Also need to mess with Orgzly so that it marks notes that are created on the phone.
by brians on 3/2/22, 2:46 PM
Oh, sweet summer child. Scribe/mss. Koalapad. A bunch of Apple 2GS, Apple 3, and Lisa formats. Lotus Improv.
The points about semantics and authenticity are wonderful, but I think the presumption that all formats can be opened is mistaken exactly because those that can’t be opened become effectively invisible and lost.
by ggm on 3/2/22, 11:51 AM
by briandoll on 3/2/22, 5:02 PM
I've been using computers daily for about 35 years now and I have a _lot_ of plain text files that I regularly use -- notes, lists, outlines, quotes, links, etc. Does anyone who has been around a while, have a large multi-decade collection of texts that are _not_ plain text? What formats do you use? How do you maintain access to those files over time?
by titzer on 3/2/22, 4:09 PM
I would contend that capturing a picture is absolutely a massive distortion of reality because reality is three dimensional, exists in many spectra beyond visible light, has sounds, smells, taste, and feeling, and exists in a historical context. The selection of framing, distance, focus, all of these are biases of the photographer. A photo is a lie, too. Just because it's higher resolution doesn't mean it has indeed captured the right information.
Text is a lie too, granted. But in our current digitization zeitgeist, we have forgotten that our media (pictures, video, recordings, not just the TV, cable, and internet) lie to us. Our own bias towards slicing apart the world into computer-digestible bits is just us lying more convincingly to ourselves.
by orzig on 3/2/22, 1:10 PM
by copperx on 3/2/22, 11:49 AM
Some examples are sorely needed. How is a Word/InDesign file more authentic than a plain text file? Or is the author talking about media? Is a ProTools session more authentic than Wav files?
by jauco on 3/2/22, 2:57 PM
Real archivists use a lot of data :)
by davbryn1 on 3/2/22, 5:02 PM
Or, you need to become a better writer.
by Annatar on 3/2/22, 2:03 PM
by nicbou on 3/2/22, 8:00 PM
My timeline thing [0] keeps the original archives, stores the timeline entries in a database, and exports them hourly as JSON + files. If the code stops working or the database crashes, the files are still there. The automated backups are there too. No information is lost.
However, the richness is not lost in the process. This timeline has geolocation history, notebook scans and a bunch of other things that don't really translate to plain text.
The most important difference is that I can write to my timeline from my phone. Managing text files across devices is quite troublesome by comparison. If I want plain text out of it, I can write a new Destination that pipes entries to plain text files or to a fax machine.
by dorfsmay on 3/2/22, 2:43 PM
by writegit on 3/2/22, 5:13 PM
I have a daemon that watches for binary changes in writing documents.
If changes are identified then it runs:
$ libreoffice --headless --convert-to txt <CHANGED_FILES>
Then commits the plaintext to a git repo.Allows for diffs, text search, and "longevity" across "authentic" docs.
by VariableStar on 3/2/22, 12:50 PM
by highspeedbus on 3/2/22, 2:53 PM
Despite text being fully portable, it is limited when it's needed to link a image or other files. People often forget how useful this concept is.
Html is not a viable option as it is awfully verbose for taking simple a note.
Markdown adds just enough semantics that is perfectly readable. From a hex editor to Microsoft Word.
We're in a somewhat critical moment, where markdown can either stay as it is, then dominate and become a godsend format of solid usability for decades, or a harmful feature is added that would slowly drag the whole thing down until the next Just Write Plain Text blog post.
by ad404b8a372f2b9 on 3/2/22, 12:34 PM
by Beldin on 3/2/22, 11:51 PM
That would give you great "authenticity" (in his definition) and great longevity.
Not practical for reading back, but that was not the point. With the help of a few simple scripts, writing is easy. So, in the end, not really an argument against storing information exclusively in plaintext.
by jjice on 3/2/22, 3:58 PM
by thematrixadmin on 3/2/22, 1:09 PM
by happyglands on 3/2/22, 4:38 PM
by m348e912 on 3/2/22, 10:21 PM
by amiga1200 on 3/2/22, 1:02 PM
by jdvh on 3/2/22, 1:07 PM
Text+ is compelling because you can have images and some kind of formatting. You want to store metadata and have backlinks and tags. Ideally with the possibility of collaborative editing.
There should be a way to fuse these two.
by quasarj on 3/2/22, 5:04 PM
by chaxor on 3/2/22, 6:42 PM
This would be great for many reasons. At the top of that list for example, is getting a lot more use out of those hard drives you paid for.
by dade_ on 3/2/22, 1:02 PM
by anotherevan on 3/3/22, 9:45 PM
Paraphrased: Make your information capture format as simple as possible, but no more so.
by gandalfff on 3/2/22, 3:07 PM
by a1445c8b on 3/2/22, 2:39 PM