HN comments for: Writing a Unix clone in about a month

kpw94

32 replies

1d1h

2024-05-24 17:10:18 UTC

I also finally learned how signals work from top to bottom, and boy is it ugly. I’ve always felt that this was one of the weakest points in the design of Unix and this project did nothing to disabuse me of that notion.

Would love any resources that goes in more details, if any HN-er or the author himself knows of some!

chubot

9 replies

2024-05-24 18:19:38 UTC

If you haven't already, I would start with Advanced Programming in the Unix Environment by Stevens

https://www.amazon.com/Advanced-Programming-UNIX-Environment...

It is about using all Unix APIs from user space, including signals and processes.

(I am not sure what to recommend if you want to implement signals in the kernel, maybe https://pdos.csail.mit.edu/6.828/2012/xv6.html )

---

It's honestly a breath of fresh air to simply read a book that explains clearly how Unix works, with self-contained examples, and which is comprehensive and organized. (If you don't know C, that can be a barrier, but that's also a barrier reading blog posts)

I don't believe the equivalent information is anywhere on the web. (I have a lot of Unix trivia on my blog, which people still read, but it's not the same)

IMO there are some things for which it's really inefficient to use blog posts or Google or LLMs, and if you want to understand Unix signals that's probably one of them.

(This book isn't "cheap" even used, but IMO it survives with a high price precisely because the information is valuable. You get what you pay for, etc. And for a working programmer it is cheap, relatively speaking.)

balder1991

6 replies

22h33m

2024-05-24 19:53:28 UTC

I believe this was the 3rd time I’ve seen this book being recommended this week. It must mean something.

pjmlp

2 replies

22h20m

2024-05-24 20:06:18 UTC

It is a must for anyone serious about UNIX programming.

Additionally one should get the TCP/IP and UNIX streams books from the same collection.

philosopher1234

1 replies

15h6m

2024-05-25 03:20:17 UTC

Is the Unix streams book “Unix Systems V network programming”?

pjmlp

0 replies

9h53m

2024-05-25 08:33:36 UTC

That one is also relevant, yeah.

Although, I did a mistake, I was thinking about all Richard Stevens books for networking, that go beyond plain TCP, UDP, IP.

https://en.wikipedia.org/wiki/W._Richard_Stevens

Unfortunelly given their CS focus, they are kind of on the expensive side, I read most of them via libraries, or eventually getting my own copies.

madhadron

0 replies

22h26m

2024-05-24 20:01:13 UTC

It's been the standard reference for decades for a reason. I learned from it, too. There's really nothing else quite like it available.

lanstin

0 replies

17h55m

2024-05-25 00:31:23 UTC

It's well written and full of practical advice and fun to read.

Terr_

0 replies

22h27m

2024-05-24 19:59:47 UTC

It might mean the Baader–Meinhof effect.

aspectmin

1 replies

19h22m

2024-05-24 23:04:19 UTC

Not positive, but pretty sure that this, and the Unix Network book were golden for us in the 90s when we were writing MUDs. Explained so much about Socket communications (bind/listen/accept,...) Been a long time since I looked at that stuff, but those were fun times.

HankB99

0 replies

16h25m

2024-05-25 02:02:14 UTC

I believe that's the book I still have on my shelf. IIRC "UNIX Network Programming" and I learned a lot about networking and a lot about how UNIX works reading it cover to cover. I think I learned more from that book than any other.

Mr Stevens replied to something I wrote back in the day. I can't recall if it was a Usenet post or email, but I was over the moon!

retrac

6 replies

2024-05-24 18:00:33 UTC

Signals are at the intersection of asynchronous IO/syscalls, and interprocess communication. Async and IPC are also weak points in the original Unix design, not originally present. Signals are an awkward attempt to patch some async IPC into the design. They're prone to race conditions. What happens when you get a signal when handling a signal? And what to do with a signal when the process is in the middle of a system call, is also a bit unclear. Delay? Queue? Pull process out of the syscall?

If all syscalls are async (a design principle of many modern OSes) then that aspect is solved. And if there is a reliable channel-like system for IPC (also a design principle of many modern OSes) then you can implement not only signals but also more sophisticated async inter-process communication/procedure calls.

Joker_vD

3 replies

23h19m

2024-05-24 19:08:04 UTC

As I wrote in some older discussion about UNIX signals on HN, the root problem (IMHO, of source) is that signals conflate three different useful concepts. The first is asynchronous external events (SIGHUP, SIGINT) that the process should be notified about in a timely manner and given an opportunity to react; the second is synchronous internal events (SIGILL, SIGSEGV) caused by the process itself, so it's basically low-level exceptions; and the third is process/scheduling management (SIGKILL, SIGSTOP, SIGCONT) to which the process has no chance to react so it's basically a way to save up on syscalls/ioctls on pidfds. An interesting special case is SIGALRM which is an asynchronous internal event.

See the original comment [0] for slighlty more spellt out ideas on better designs for those three-and-a-half concepts.

[0] https://news.ycombinator.com/item?id=39595904

mananaysiempre

2 replies

17h49m

2024-05-25 00:37:44 UTC

At least the first two are also conflated in a typical CPU’s trap/interrupt/whatever-your-architecture-calls-it model, which is what Unix signals are essentially a copy of. So this isn’t necessarily illogical.

KerrAvon

1 replies

15h25m

2024-05-25 03:01:21 UTC

SIGHUP and SIGINT have no CPU-level equivalent.

mananaysiempre

0 replies

8h19m

2024-05-25 10:07:43 UTC

Sure. What I meant is, a CPU’s trap/interrupt mechanism is very often used to signal both problems that arise synchronously due to execution of the application code (such as an illegal instruction or a bus error) and hardware events that happen asynchronously (such as a timer firing, a receiver passing a high-water mark in a buffer, or an UART detecting a break condition). This is not that far away from SIGSEGV vs SIGHUP.

Some things (“imprecise traps”) sometimes blur the difference between the two categories, but they usually admit little in the way of useful handling. (“Some of the code that’s been executing somewhere around this point caused a bus error, now figure out what to do about it.”)

lanstin

0 replies

17h49m

2024-05-25 00:38:04 UTC

A story about the problem with delivering interrupts to a process in kernel mode in unix:

https://www.dreamsongs.com/RiseOfWorseIsBetter.html

chasil

0 replies

21h55m

2024-05-24 20:31:48 UTC

IPC was actually introduced in "Columbus UNIX."

https://en.wikipedia.org/wiki/CB_UNIX

pcwalton

3 replies

2024-05-24 18:08:13 UTC

"signalfd is useless" is a good article: https://ldpreload.com/blog/signalfd-is-useless

It goes into the problems with Unix signals, and then explains why Linux's attempt to solve them, signalfd, doesn't work well.

lelanthran

2 replies

12h18m

2024-05-25 06:08:44 UTC

That is a good article. I found myself nodding in agreement while reading it, thinking "Yeah, I've been bitten by that before".

How does Windows handle this? There's still signals, but I believe/was under the impression that signals in Windows are an add-on to make the POSIX subsystem work, so maybe it isn't as broken (for example, I think it doesn't coalesce signals).

okanat

1 replies

5h51m

2024-05-25 12:36:03 UTC

Windows has a slightly better concept: Structured Exceptions (https://learn.microsoft.com/en-us/windows/win32/debug/struct...). It is a universal concept to handle all sorts of unexpected situations like divide by zero, illegal instructions, bad memory accesses... For console actions like Ctrl+C it has a separate API which automatically creates a thread for the process to call the handler: https://learn.microsoft.com/en-us/windows/console/handlerrou... . And of course Windows GUI apps receive the Window close events as Win32 messages.

Normal windows apps doesn't have a full POSIX subsystem running under them. The libc signal() call is a wrapper around structured exceptions. It is limited to only a couple well-known signals. MSVCRT does a bunch of stuff to provide a emulation for Unix-style C programs: https://learn.microsoft.com/en-us/cpp/c-runtime-library/refe...

In contrast to Unix signals, structured exceptions can give you quite a bit more information about what exactly happened like the process state, register context etc. You can set the handler to be called before or after the OS stack unwinding happens.

lelanthran

0 replies

5h1m

2024-05-25 13:25:21 UTC

I am such a moron. Every one of those three links above is colored as 'visited' for me.

I have obviously read this up before and just didn't remember :-(

jkrejcha

3 replies

17h17m

2024-05-25 01:10:14 UTC

Unix signals do... a lot of things that are separate concepts imo, and I think this is why there are people who don't like it or take issue with it.

You have SIGSTOP/SIGCONT/SIGKILL, which don't even really signal the process, they just do process control (suspend, resume, kill).

You have simple async messages (SIGHUP, SIGUSR1, SIGUSR2, SIGTTIN, SIGTTOU, etc) that get abused for reloading configuration/etc (with hacky workarounds like nohup for daemonization) or other stuff (gunicorn for example uses the latter 2 for scaling up and down dynamically). There's also in this category bizarrely specific things like SIGWINCH.

You also have SIGILL, SIGSEGV, SIGFPE, etc for illegal instructions, segmentation violations, FP exceptions, etc.

And also things that might not even be good to have as async things in the first place (SIGSYS).

---

As an aside, it's not the only approach and there's definitely tradeoffs with the other approaches.

Windows has events, SEH (access violations, other exceptions), handler routines (CTRL+C/CTRL+BREAK/shutdown,etc), and IOCPs (async I/O), callbacks, and probably some other things I'm forgetting at the moment.

Plan 9 has notes which are strings... which lets you send arbitrary data to another process which is neat, but it using the same mechanism for process control imo has the same drawbacks as *nix except now they're strings instead of a single well-defined number.

jclulow

2 replies

16h36m

2024-05-25 01:51:09 UTC

The Windows mechanisms you're mentioning were also added over the course of many, many years. Much of Windows also happened a long time after UNIX signals were invented.

If you're including all that other stuff, it's probably fair to include all of the subsequent development of notification mechanisms on the UNIX side of the fence as well; e.g., poll(2), various SVR4 IPC primitives, event ports in illumos, kqueue in FreeBSD, epoll and eventually io_uring in Linux.

jkrejcha

0 replies

16h0m

2024-05-25 02:26:26 UTC

Yeah, it definitely is (especially since SIGIO is a thing :)). Even the Unix signals had more added to them over time (SIGWINCH and friends iirc came from the BSDs).

A lot of the mechanisms are very OS specific but I do think they're good comparisons to have with signals as well.

flykespice

0 replies

15h49m

2024-05-25 02:38:00 UTC

Except much of these UNIX later development were done by their derivatives and are often available with certain degree of incompatibility among them (or not even at all)

chasil

2 replies

23h46m

2024-05-24 18:41:07 UTC

There were differences between BSD and SYSV signal handling that were problematic in writing portable applications.

https://pubs.opengroup.org/onlinepubs/009604499/functions/bs...

It's important to remember that code in a signal handler must be re-enterant. "Nonreentrant functions are generally unsafe to call from a signal handler."

https://man7.org/linux/man-pages/man7/signal-safety.7.html

convolvatron

1 replies

22h59m

2024-05-24 19:27:59 UTC

reentrancy is not sufficient here - at least that provided by mutex style exclusion. the interrupted thread may have actually been the one holding the lock, so if the signal handler enters a queue to wait for it, it may be waiting quite a while

tedunangst

0 replies

22h9m

2024-05-24 20:17:50 UTC

That's why the word reentrant is used, not thread safe.

NikkiA

1 replies

2024-05-24 18:03:42 UTC

I always felt VMS' mailbox system was much more elegant, but I imagine it's an ugly mess under the surface too.

https://wiki.vmssoftware.com/Mailbox

MisterTea

0 replies

2024-05-24 18:27:06 UTC

I like Plan 9's notes: http://man.postnix.pw/9front/2/notify

palata

0 replies

2024-05-24 17:34:34 UTC

I wanted to say the exact same thing! I would love to get more details about that.

eterps

0 replies

2024-05-24 17:50:40 UTC

Would love to read a blog post about that.

samatman

19 replies

1d1h

2024-05-24 16:37:18 UTC

I was interested in Hare until I found this immensely self-defeating FAQ item: https://harelang.org/documentation/faq.html#will-hare-suppor...

As a baseline, I support developers using whatever license they would like, and targeting whatever operating systems, indeed, writing whatever code they would like in the process.

That doesn't make this specific policy a good idea. Even FSF, generally considered the most extreme (or, if you prefer, principled) exponents of the Free Software philosophy, support Windows and POSIX. They may grumble and call it Woe32, but Stallman has said some cogent things about how the fight for a world free of proprietary software is more readily advanced by making sure that Free Software projects run on proprietary systems.

They do at least license the library code under MPL, so merely using Hare doesn't lock you into a license. But I wonder about the longevity of a language where the attitude toward 95+% of the desktop is "unsupported, don't ask questions on our forums, we don't want you here".

Ironically, a Google search for "harelang repo" has as the first hit an unofficial macOs port, and the actual SourceHut repo doesn't show up in the first page of results.

Languages either snowball or fizzle out. I'm typing this on a Mac, but I could pick up a Linux machine right now if I were of a mind to. But why would I invest in learning a language which imposes a purity test on developers, when even the FSF doesn't? A great deal of open source and free software gets written on Macs, and in fact, more than you might think on Windows as well.

From where I sit, what differentiates Hare from Odin and Zig, is just this attitude of purity and exclusion. I wish you all happy hacking, of course, and success. But I'm pessimistic about the latter.

palata

4 replies

2024-05-24 17:44:36 UTC

I don't think that Apple particularly cares about porting their software to Linux. Do you feel the same about Apple? That with such an attitude, they surely cannot succeed?

samatman

3 replies

2024-05-24 17:56:26 UTC

Apple releases a great deal of open source software, which, so far as I'm aware, all runs on Linux as well. At least Swift, clang, and LLVM, all run on Windows as well. So does their Objective C compiler, so of Apple's programming languages, that leaves AppleScript. I would not describe AppleScript as robustly successful.

I believe Apple could probably get away with keeping Swift proprietary, or only supporting Apple platforms. But they don't. I have no inside-track information on why that is, but I suspect the reason is fairly simple: developers wouldn't like it.

palata

1 replies

23h46m

2024-05-24 18:40:37 UTC

so of Apple's programming languages

So the whole part of your message about "even the FSF saying that free software should run on proprietary system" works when you want to criticize Hare, but not when looking at Apple proprietary software, right?

A language is just another piece of software, I don't see why you should apply different rules to a programming language than, e.g. to a serializing system like Protobuf. And I don't think Google actively supports swift-protobuf (https://github.com/apple/swift-protobuf).

Hare upstream just says "we are not interested in supporting non-free OSes, but we won't prevent you from doing it". It's your choice to not use Hare because of this, but it's their choice to not support macOS.

samatman

0 replies

19h45m

2024-05-24 22:42:15 UTC

As a baseline, I support developers using whatever license they would like, and targeting whatever operating systems, indeed, writing whatever code they would like in the process.

That doesn't make this specific policy a good idea.

saagarjha

0 replies

20h21m

2024-05-24 22:05:39 UTC

You will note that Apple invests approximately zero effort in making those projects portable.

sramsay

3 replies

2024-05-24 17:31:37 UTC

"We cannot effectively study, understand, debug, or improve, the underlying operating system if it is non-free. We actively work with the source code for the systems on which we depend, and we are not interested in supporting any platforms for which this is not possible."

I understand that you don't like it, but how do you come to regard a statement like this as "arbitrary?" It's exclusive, for sure. "Purity test" is one way to characterize it. But do you really think that statements like this are just the product of individual caprice? That it's not someone's attempt at a principled intervention, but just an "attitude?"

samatman

0 replies

2024-05-24 18:09:55 UTC

You're right, it isn't arbitrary. I removed that word from the post and edited it to express my opinion more clearly.

apantel

0 replies

21h44m

2024-05-24 20:42:28 UTC

I was going to post the same quote. If you have no visibility into the layer you depend on, you really can’t reason about it or write optimized code for it.

The Hares are saying they require that, which I totally understand and respect.

PhilipRoman

0 replies

2024-05-24 17:50:28 UTC

Ouch, I hadn't really considered it before but that quote deeply resonates with me. The experience of trying to debug windows wifi system is day and night compared to wpa_supplicant/mac80211.

kbolino

1 replies

1d1h

2024-05-24 17:04:38 UTC

On the one hand, I can respect the authors for sticking to what they want to accomplish and not accommodating every demand.

On the other hand, that is hardly the only thing from the FAQ that raises one's eyebrows:

we have no package manager and encourage less code reuse as a shared value

qbe generates slower code compared to LLVM, with the performance ranging from 25% to 75% the runtime performance of comparable LLVM-generated code

Can I use multithreading in Hare? Probably not.

So I need to implement hash tables myself? Indeed. Hash tables are a common data structure that many Hare programs will need to implement from scratch.

As it stands, this is definitely not a language designed for mass adoption. Which is fine, and at least they're upfront about it.

jay-barronville

0 replies

18h58m

2024-05-24 23:28:35 UTC

Some of those design decisions I’m okay with, but deliberately not providing a basic hash table for general usage is pretty bizarre. I can’t think of even one serious software project I’ve worked on that didn’t need a dictionary/map-like data structure somewhere in the code.

stonogo

0 replies

1d1h

2024-05-24 16:52:53 UTC

Sounds like you and the Hare people have different definitions of success. As for "languages either snowball or fizzle out," I feel like that's pretty dismissive of a lot of languages that have been steadily marching on for decades even without this rockstar status.

Not every band has to hit the Billboard charts to be worth listening to.

skydhash

0 replies

1d1h

2024-05-24 17:15:17 UTC

But why would I invest in learning a language which imposes an arbitrary purity test on developers?

While I understand your concerns, I disagree with your the idea of “imposition”. Someone doing something for free doesn’t owe anyone to do it in a particular way (as long as it’s not malevolent). You’re free to express your opinion, but if the developer has already established his guidelines, criticisms like this is not constructive.

sakras

0 replies

16h14m

2024-05-25 02:12:37 UTC

My real showstopper with Hare is the lack of multithreading. In the modern world, we need to be making parallelism easier not harder!

jampekka

0 replies

1d1h

2024-05-24 16:56:18 UTC

"The goal of Hare is not to achieve the broadest possible reach, but to be a part of a broader system which effectively achieves Hare’s goals."

cardanome

0 replies

18h48m

2024-05-24 23:39:06 UTC

I think focusing on Linux makes sense for limiting the scope of the project. Supporting Mac sucks when you own no Apple hardware and have no personal interest in the ecosystem. Windows users probably can just use WSL, right? Or I mean, people use docker these days anyway.

So I get it. Especially if it is to be a more niche or pet project but then again I don't buy the ideological reason. I am a really big proponent of free software and their stance just doesn't make any sense. I agree with you here. But then again they can do whatever they want.

bee_rider

0 replies

1d1h

2024-05-24 16:59:35 UTC

It says they won’t officially support Windows or MacOS. Some other project can try to port it if they want, right? It seems good of them to be honest about their intended level of support.

Supporting an OS the devs don’t use is a big ask.

WhyNotHugo

0 replies

17h23m

2024-05-25 01:03:36 UTC

There's no purity test and the Hare devs aren't prohibiting you from using Hare on macOS or any other platform.

They just don't want to maintain Mac/Windows ports themselves. If somebody else is interested, they can maintain a port. Like that macOS one that you've already found.

2pEXgD0fZ5cF

0 replies

22h17m

2024-05-24 20:09:45 UTC

Languages either snowball or fizzle out.

This is not true and a naive statement. There are quite few languages which are not popular across the board but have a very firm niche in which they thrive and fulfill critical roles.

mtillman

15 replies

1d1h

2024-05-24 17:01:57 UTC

This is really cool. Reminds me of the original Unix was invented in a couple weeks while Ritchie's family went on vacation to CA to visit his in-laws.

Source: UNIX: A History and a Memoir Paperback – October 18, 2019 by Brian W Kernighan (Author)

balder1991

7 replies

1d1h

2024-05-24 17:11:31 UTC

But I think it’s relevant to say that before writing Unix he was working on Multics for a long time already. Unix was a “simplified” version of it, if I remember well. So it didn’t “spring out of thin air.”

teleforce

2 replies

30m

2024-05-25 17:56:36 UTC

Unix was a kind of play word for Unique as an anti-thesis for Multics that latter was originally designed for modern multi-user and multi-process OS. Ironically as any real-world OS Unix eventually becomes multi-user system similar to Multics but the name stucked. Granted Unix has a very simple (as in simple as possible but no simpler) multi-user permission and security system that work reliably for many decades until now. Of all the organizations NSA actually even come up with a better replacement for the modern Unix permission and security model with SELinux, but most users just ignored and disabled SELinux although it's installed by default by many major Linux distros [1].

[1] SELinux is unmanageable; just turn it off if it gets in your way:

https://news.ycombinator.com/item?id=31176138

pjmlp

0 replies

2024-05-25 18:19:58 UTC

Not something that you can disable on Android, or on properly managed Linux servers, where devs only get what they should touch on.

JoeAltmaier

0 replies

28m

2024-05-25 17:59:11 UTC

I understood it was Unix for 'one mechanism' or 'unified' instead of the broad everything-but-the-kitchen-sink Multix approach. That was the joke I understood. Notthing about single-user.

trollerator23

0 replies

23h33m

2024-05-24 18:53:34 UTC

Absolutely.

fuzztester

0 replies

20h45m

2024-05-24 21:41:24 UTC

So it didn’t “spring out of thin air.”

Right. Almost nothing does.

You see, it's https://en.m.wikipedia.org/wiki/Turtles_all_the_way_down

eichin

0 replies

20h34m

2024-05-24 21:53:14 UTC

Mmm, even early versions ended up being more the "anti-multics" than actually simplified-from, despite the name pun...

DiggyJohnson

0 replies

12h52m

2024-05-25 05:35:09 UTC

Where did the quoted text come from? Something might have gotten edited.

naitgacem

1 replies

2024-05-24 18:13:38 UTC

i thought that story was about 3 programs that were missing, a text editor being one of them.

I'll have to check because my memory is failing me atm.

Zambyte

0 replies

2024-05-24 18:17:56 UTC

I allocated a week each to the operating system, the shell, the editor and the assembler

http://www.groklaw.net/article.php?story=20050414215646742

laxd

1 replies

23h2m

2024-05-24 19:24:23 UTC

I think you mean Ken Thompson. I can't be bothered searching through youtube interviews but I'm pretty shure that on more than one occasion, he tells a story something along the lines of having a disk driver, some programs, and maybe some other components. His wife went on a trip and he figured it would be enough time to fill in the gaps and make a complete OS.

jasone

0 replies

22h6m

2024-05-24 20:20:20 UTC

I'm pretty sure that is mentioned in this interview:

https://www.youtube.com/watch?v=wqI7MrtxPnk

By the way the CHM oral history video series is full of gems.

dboreham

1 replies

18h27m

2024-05-24 23:59:47 UTC

But Unix itself took many years to write (if you count V7 as "properly finished Unix"). The first version was only a filesystem, for example.

lproven

0 replies

7h51m

2024-05-25 10:36:03 UTC

So was MS-DOS. Sold in the tens of millions and kick-started the entire x86 PC industry, though.

Sometimes small and simple is good.

AlexeyBrin

0 replies

7h39m

2024-05-25 10:47:30 UTC

I think you are confusing Dennis Ritchie with Ken Thompson.

andsoitis

13 replies

1d2h

2024-05-24 16:15:32 UTC

Impressive, super cool, and inspiring!

Example of “creating something impressive in X days” requires a lot of experience and talent that is built over years.

beryilma

3 replies

22h42m

2024-05-24 19:44:21 UTC

Versus now... I changed the text on a button with an internationalized string. It only took me about a week.

I put the English string in the catalog, updated a number of tests, run the tests on the local system, pushed the change to staging cluster, fix unanticipated test failures, push the change to production, contact the translators to have the string translated to a number of languages, and have documentation updated.

Muromec

1 replies

22h23m

2024-05-24 20:03:53 UTC

So... It goes to production before you get translations to all the languages?

beryilma

0 replies

20h33m

2024-05-24 21:53:44 UTC

In my case, the "production" does not really become visible to users right away. Perhaps, I should have called it "pre-production".

lupire

0 replies

3h9m

2024-05-25 15:17:41 UTC

I suggest you use translation management tools, so the translator gets the strong as soon as you add it to the catalog.

Buy anyway there's no "then vs now" when you are really comparing "prototype" to "deliver to users". It took Unix decades to get those strings translated.

saagarjha

2 replies

20h25m

2024-05-24 22:02:12 UTC

Drew is smart and his timeline is short but I think it’s the wrong way to look at it if you just put him on a pedestal for it. Making a UNIX clone is a typical undergrad project at most universities. Extending that to something that is complete is something that requires perseverance, not special genius.

smugma

0 replies

3h16m

2024-05-25 15:10:17 UTC

NachOS was developed at Berkeley and maintained at UW. Both are top-ranked CS programs. Undergraduates are expected to add features to the core OS e.g. virtual memory, not build it from scratch.

https://en.wikipedia.org/wiki/Not_Another_Completely_Heurist...

https://homes.cs.washington.edu/~tom/nachos/

bjoli

0 replies

19h22m

2024-05-24 23:04:42 UTC

I think it is a matter of how you are exposed to programming. I started with pascal at 9, and I wrote my first (VM-)bootable OS in junior high school (around the age of 14). Not as fancy as this of course, but it booted into an environment not unlike r4rs scheme - based on SIOD. A scheme error was handled but any C errors would immediately lead to a kernel panic.

I am not a programmer today, but I can still wrap most of my head around many low level concepts. I can't, however, write anything resembling a modern web page. Nor can I understand how any larger JS application works.

pushedx

2 replies

1d1h

2024-05-24 17:01:46 UTC

Also the creator of KnightOS, written entirely in Z80 assembly, more than 12 years ago!

https://www.ticalc.org/archives/files/fileinfo/463/46387.htm...

ruined

0 replies

23h32m

2024-05-24 18:54:56 UTC

holy shit. they're already a living legend but somehow i didn't make this connection

lupire

0 replies

3h35m

2024-05-25 14:52:14 UTC

https://drewdevault.com/2020/01/27/KnightOS-was-interesting....

Sadly defunct. I guess the real OS was the syscalls we made along the way.

PaulDavisThe1st

1 replies

1d2h

2024-05-24 16:26:57 UTC

... and also a previously kernel implementation called Helios to provide a lot of the lowest level code. Not trying to knock down the accomplishment, but DD is pretty open about the fact that a lot of the speed of this project was dependent on having done Helios first (and reusing code from it).

palata

0 replies

2024-05-24 17:35:38 UTC

...which is part of the "experience and talent built over years", I guess? :-)

ezconnect

0 replies

20h14m

2024-05-24 22:13:00 UTC

He only have Helios so he just integrated a few missing parts.

anta40

8 replies

1d1h

2024-05-24 16:44:47 UTC

Very cool. Most of these Unix clones are usually written in C. This one is written in a new programming language.

pjmlp

3 replies

2024-05-24 18:13:29 UTC

There were UNIX written in Ada and Pascal, naturally C has a special relationship.

trollerator23

2 replies

23h32m

2024-05-24 18:54:49 UTC

Really??

pjmlp

0 replies

23h7m

2024-05-24 19:20:09 UTC

Yep,

https://en.m.wikipedia.org/wiki/Apollo_Computer

https://marte.unican.es/

JPLeRouzic

0 replies

23h15m

2024-05-24 19:11:53 UTC

In France, in the 1980" there was a Unix clone written in Pascal by the CNET (the R&D of the incumbent phone operator). The CPU was a M68K and the hard disk had 20Mbytes (if I recall correctly). I don't remember the name of the beast.

balder1991

3 replies

2024-05-24 17:30:08 UTC

I only read part of the FAQ. I find the desire to keep the complexity low by limiting the compiler lines of code and not using LLVM interesting, but I wonder how practical it is. The FAQ admits that because of this, it generates slower code. So it shifts the complexity to the software codebase, by telling the users to “use assembly where needed”.

Seems a bit like Python’s philosophy of not introducing too much optimizations to prevent the runtime complexity from spiraling out of control.

PhilipRoman

1 replies

2024-05-24 17:41:42 UTC

I doubt it is a real problem for anything other than number crunching. I like to use tcc during development (which does very little, if any optimizations) to speed up compilation and I never noticed any regressions in performance, even for GUI software. Throughput just isn't that big of a deal for most applications (although latency and resource usage is, but that's not affected by choice of compiler).

fuzztester

0 replies

2h48m

2024-05-25 15:38:50 UTC

You are using C (with TCC) for GUI apps? with what GUI framework or library?

cardanome

0 replies

19h13m

2024-05-24 23:14:07 UTC

From what I heard LLVM seems to be not very great at keeping backwards compatibility and makes no guarantees that the IR (intermediate representation) stays the same. So I imagine it can be frustrating to have a moving target.

Plus it is a heavy dependency which means projects like writing a self-hosting OS in a month are much less realistic to achieve when your compiler relies on LLVM.

And not the least, the code generation is pretty slow. If your languages cares greatly about compile speed, which it should, this is a bummer.

So yeah, for many projects avoiding LLVM might be a good idea.

amelius

8 replies

2024-05-24 17:39:56 UTC

Waiting for an OS that treats GPU(s) as a first class citizen ...

eterps

3 replies

2024-05-24 17:56:07 UTC

That wouldn't be too hard if GPU's would have a stable interface. Try programming a GPU in Assembly language and see how that goes. The experience sucks, but that's the level that needs to be targeted in case of an OS.

eterps

2 replies

2024-05-24 18:04:36 UTC

For example, in the past Amiga computers had a 'GPU' (although much less powerful than todays GPUs) with a stable interface. It was a first class citizen in its OS. It also was incredibly easy to target in Assembly language.

pjmlp

0 replies

2024-05-24 18:10:32 UTC

Blitter was great, but those were simpler times.

The best we have nowadays is using compute shaders for the same purpose.

Just like when using a TMS34010 with its C SDK.

mepian

0 replies

20h57m

2024-05-24 21:29:38 UTC

Amiga died because it was stuck with the same old "GPU" for too long, among other reasons.

sph

1 replies

11h41m

2024-05-25 06:45:51 UTC

If programming GPU drivers was not something only a handful of employees with NVIDIA or AMD badges could do (because of NDAs, non-public documentation and immense complexity), somebody would have tried.

amelius

0 replies

6h2m

2024-05-25 12:24:17 UTC

The point (for some) of writing your own OS is that you do something only a handful of people can do ...

saagarjha

0 replies

20h24m

2024-05-24 22:03:16 UTC

What should this OS do?

bobmcnamara

0 replies

4h45m

2024-05-25 13:41:18 UTC

Are you thinking something like Plan9 instances, cloud-in-a-box style, or something else?

nickcw

3 replies

1d1h

2024-05-24 16:53:00 UTC

Hare looks like an interesting language.

Though this limitation will limit its adoption in this multicore age I think:

From the FAQ https://harelang.org/documentation/faq.html

....

Can I use multithreading in Hare?

Probably not.

We prefer to encourage the use of event loops (see unix::poll or hare-ev) for multiplexing I/O operations, or multiprocessing with shared memory if you need to use CPU resources in parallel.

It is, strictly speaking, possible to create threads in a Hare program. You can link to libc and use pthreads, or you can use the clone(2) syscall directly. Operating systems implemented in Hare, such as Helios, often implement multi-threading.

However, the upstream standard library does not make reentrancy guarantees, so you are solely responsible for not shooting your foot off.

senkora

1 replies

1d1h

2024-05-24 17:22:35 UTC

multiprocessing with shared memory if you need to use CPU resources in parallel

This is actually pretty powerful. I personally prefer it for most purposes, because it restricts the possibility of data races to only the shared memory regions. It's a little like an "unsafe block" of memory with respect to data races.

pjmlp

0 replies

22h17m

2024-05-24 20:09:26 UTC

I changed from a strong threads believer and dynamic libraries plugins, exactly because of attack vector and host program stability.

packetlost

0 replies

21h56m

2024-05-24 20:30:29 UTC

I just wish it had closures

lupusreal

2 replies

1d2h

2024-05-24 16:21:30 UTC

Missed opportunity to call it Drewnix.

jrpelkonen

1 replies

2024-05-24 17:55:24 UTC

Fun fact: Linus Torvalds originally named his fledgling OS as “Freax”, but it was an FTP site admin who came up with “Linux” and the rest is history. So perhaps the opportunity is not completely missed…

davisr

0 replies

20h7m

2024-05-24 22:19:29 UTC

No he didn't.

"I called it Linux originally as a working name. That was just because "Linus" and the X has to be there--it's UNIX, it's like, a law--and what happened was that I initially thought that I can't call it "Linux" publicly because it's just too egotistical. That was before I had a big ego."

https://yewtu.be/watch?v=kZlOCHYu1Vk

westurner

1 replies

18h33m

2024-05-24 23:53:22 UTC

From "Linux System Call Table – Chromiumos" https://www.chromium.org/chromium-os/developer-library/refer... https://news.ycombinator.com/item?id=33395777 :

google/syzkalleR

Fuschia / Zircon syscalls: https://fuchsia.dev/fuchsia-src/reference/syscalls

westurner

0 replies

5h52m

2024-05-25 12:35:16 UTC

And a new one, a new syscall this year: mseal()

"Memory Sealing "Mseal" System Call Merged for Linux 6.10" https://news.ycombinator.com/context?id=40474551

thefaux

1 replies

23h43m

2024-05-24 18:44:07 UTC

Impressive work but I feel this approach is the hard and brittle way to write an os. The easier and more portable way is to write the os as a guest in a host language. You start with a simple shell with the print command and build from there.

palata

0 replies

23h38m

2024-05-24 18:48:23 UTC

I hope it's not too easy then... imagine what he could do in 27 days if this was the "hard and brittle way" :-).

LightFog

1 replies

1d1h

2024-05-24 17:07:30 UTC

It was really cool watching the ~daily updates on this on Mastodon - seeing how someone so skilled gradually pieces together a complex piece of software.

herodoturtle

0 replies

1d1h

2024-05-24 17:16:14 UTC

Link to the mastodon thread (from Drew’s article), for those that are interested:

https://fosstodon.org/@drewdevault/112319697309218275

userbinator

0 replies

15h20m

2024-05-25 03:06:22 UTC

The userspace is largely assembled from third-party sources.

That answered my initial surprise of clicking on the ISO and getting a 60MB download.

For comparison, Linux 0.01 was a 71k download, but contained only the kernel source.

pjmlp

0 replies

1d1h

2024-05-24 16:33:44 UTC

Quite cool, by making use of Hare instead.

calvinmorrison

0 replies

2024-05-24 18:24:25 UTC

hey drew! did writing this project give you any Hare-y situations you hadn't run into before, or maybe - reached into corners not yet probed by Hare and gave you ideas for a new feature or edge case that was scary?

AtlasBarfed

0 replies

18h42m

2024-05-24 23:44:36 UTC

Are there "waypoint" commits for major milestones? Id really like to see those.

Like PC bootstrap, basic kernel action loops, process forking, yada yada

8organicbits

0 replies

1d1h

2024-05-24 16:28:48 UTC

Code is here: https://git.sr.ht/~sircmpwn/bunnix/tree/master

GPLv3 license.