From tuhs at tuhs.org Sat Mar 1 01:03:39 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Fri, 28 Feb 2025 10:03:39 -0500 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> Message-ID: On 2/28/25 12:08 AM, Greg A. Woods wrote: > Some systems, e.g. macOS (nee OS X) somewhat solved the problem by > simply adding more swap space dynamically by creating files in the root > filesystem (until the disk fills). I think this is one of the things macOS inherited from NeXT. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From imp at bsdimp.com Sat Mar 1 01:32:16 2025 From: imp at bsdimp.com (Warner Losh) Date: Fri, 28 Feb 2025 08:32:16 -0700 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> Message-ID: On Thu, Feb 27, 2025, 10:08 PM Greg A. Woods wrote: > At Thu, 27 Feb 2025 17:55:49 -0500, Henry Bent > wrote: > Subject: [TUHS] Re: Having dickens of a time compiling gcc-4.4.7 on tru64 . > > > > On Thu, Feb 27, 2025, 17:07 Phil Budne wrote: > > > > > Arrigo Triulzi wrote: > > > > > > My recall is that 4.2BSD would only give you (writable) memory it > > > could allocate swap space for. > > > > > > I think of the penchant for "overcommitting" memory > > > (and the cold hand of the OOM killer) as Linuxisms > > I'm pretty sure over-commit in virtual memory systems is older than > linux, and of course it's common in the BSDs now too. > I switched from early Linux to FreeBSD because Linux lacked overcommit at the time (or it was so buggy you had to disable it). I thought this was a 4.4bsd thing, but FreeBSD's VM saw heavy work for big workloads right away... VMS 4 or 5 allowed it, with enough tuning. Warner Windows-NT did it, as did early AIX. IBM'S VM does it, as do many other > virtual machine supervisors. Unix System V may even have done it. I > don't know where it originated, and I couldn't find a quick reference > online just now to give a history of it, but likely someone here does. > > AIX also had an "OOM" killer (but I think it was the paging system > itself, no a user-level process) -- which in early releases (the first > version 3 on RS/6000 in 1989 or 1990) just tromped on the biggest > process, which was usually the Xserver. I seem to remember the system > rebooting if X died too. Even earlier (for the RT) IBM had invented > SIGDANGER to warn of a pending OOM condition. Big users of memory could > catch SIGDANGER and know they had to release some memory if they could. > I think catchers of SIGDANGER would avoid the early SIGKILLs too, but I > don't believe the Xserver tried to catch SIGDANGER, at least not in > early releases. > > Some systems, e.g. macOS (nee OS X) somewhat solved the problem by > simply adding more swap space dynamically by creating files in the root > filesystem (until the disk fills). > > -- > Greg A. Woods > > Kelowna, BC +1 250 762-7675 RoboHack > Planix, Inc. Avoncote Farms > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stewart at serissa.com Sat Mar 1 01:57:47 2025 From: stewart at serissa.com (Lawrence Stewart) Date: Fri, 28 Feb 2025 10:57:47 -0500 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> Message-ID: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> I’m probably a lost soul on this issue, but swap space is just a way to turn program bugs into performance problems. In HPC one says “real programs need real memory”. At SiCortex we ran 972 node cluster machines without any swap space (4 or 8 GB per node) and it worked fine. Of course we didn’t have any disks either, so we made a virtue of necessity. It is perfectly true that the OOM killer was feared and hated, but only because it couldn’t identify the actual bad apple. I realize this attitude only works when you (pretty much) dedicate a node to running a single program at a time, but that is how most HPC systems of the time worked. -L From drb at msu.edu Sat Mar 1 02:04:49 2025 From: drb at msu.edu (Dennis Boone) Date: Fri, 28 Feb 2025 11:04:49 -0500 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: (Your message of Fri, 28 Feb 2025 10:57:47 -0500.) <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> References: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> Message-ID: <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> > I’m probably a lost soul on this issue, but swap space is just a way > to turn program bugs into performance problems. You're hardly the only one. Some years ago, running Linux web and database servers, I quit creating swap space. A runaway program would turn the system into an infinite game of shuffle-the-pages well before the OOM killer actually decided to kill something, and in that state, one couldn't even reboot. This expanded the time window of "broken" from tens of seconds, and perhaps a service restart, into tens of minutes and a power button recovery. Every #$%^&* time. De From tuhs at tuhs.org Sat Mar 1 02:12:19 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Fri, 28 Feb 2025 16:12:19 +0000 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> References: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> Message-ID: On Friday, February 28th, 2025 at 8:04 AM, Dennis Boone wrote: > > I’m probably a lost soul on this issue, but swap space is just a way > > > to turn program bugs into performance problems. > > > You're hardly the only one. Some years ago, running Linux web and > database servers, I quit creating swap space. A runaway program would > turn the system into an infinite game of shuffle-the-pages well before > the OOM killer actually decided to kill something, and in that state, > one couldn't even reboot. This expanded the time window of "broken" > from tens of seconds, and perhaps a service restart, into tens of > minutes and a power button recovery. Every #$%^&* time. > > De I've read several bits of guidance lately suggesting avoiding swap due to the increasing prevalence of solid-state memories. The assertion is that I/O heavy swapping, especially if you get into a thrashing state, is liable to age current storage technologies much more than it would have in the platter disk era. I've heard contrary opinions that it isn't as large of a liability in reality. I haven't settled on one or the other, I keep a swap file around on the microSD that runs my RPi, but I've only needed to swapon, like OP, when compiling gcc. - Matt G. From imp at bsdimp.com Sat Mar 1 02:20:20 2025 From: imp at bsdimp.com (Warner Losh) Date: Fri, 28 Feb 2025 09:20:20 -0700 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> Message-ID: On Fri, Feb 28, 2025 at 9:12 AM segaloco via TUHS wrote: > On Friday, February 28th, 2025 at 8:04 AM, Dennis Boone > wrote: > > > > I’m probably a lost soul on this issue, but swap space is just a way > > > > > to turn program bugs into performance problems. > > > > > > You're hardly the only one. Some years ago, running Linux web and > > database servers, I quit creating swap space. A runaway program would > > turn the system into an infinite game of shuffle-the-pages well before > > the OOM killer actually decided to kill something, and in that state, > > one couldn't even reboot. This expanded the time window of "broken" > > from tens of seconds, and perhaps a service restart, into tens of > > minutes and a power button recovery. Every #$%^&* time. > > > > De > > I've read several bits of guidance lately suggesting avoiding swap due to > the increasing prevalence of solid-state memories. The assertion is that > I/O heavy swapping, especially if you get into a thrashing state, is liable > to age current storage technologies much more than it would have in the > platter disk era. I've heard contrary opinions that it isn't as large of a > liability in reality. I haven't settled on one or the other, I keep a swap > file around on the microSD that runs my RPi, but I've only needed to > swapon, like OP, when compiling gcc. > Swapping to flash can be fine. The days when a "spot" could be worn out are 20 years in the past. If you use it as a "shock absorber" to cope with transient loads, there's no issues. However a lot depends on which kind of flash you have, you might have trouble if the load is constant. Today there's a wide range of flash drives. Ranging from < .1DWPD drives (you can't write even 10% of the drive's capacity in one day w/o wearing it out in the warranty period) to 3, 5 or 10 DWPD drives. 0.3 and 1.0 seem popular in the consumer / prosumer / low-end-datacenter space. Those can be a problem if you are constantly swapping (but again are fine for coping with transients). The MicroSD is likely to be closer to the .1-.3 DWPD (unless you bought an expensive one). Warner -------------- next part -------------- An HTML attachment was scrubbed... URL: From pugs78 at gmail.com Sat Mar 1 02:42:32 2025 From: pugs78 at gmail.com (Tom Lyon) Date: Fri, 28 Feb 2025 08:42:32 -0800 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> Message-ID: The counterpoint to "just say no to swapping" is that sometimes you really, really, want a job to proceed even if at a glacial pace. Some years ago I had to do a Linux kernel build on a 1GB(maybe less?) MIPS system, and even though it had oodles of swap space, the OOM killer kept biting. Turns out you have to tune "swappiness" in poorly documented ways to get Linux to actually use virtual memory. I think the build took about a week once it worked. On Fri, Feb 28, 2025 at 8:04 AM Dennis Boone wrote: > > I’m probably a lost soul on this issue, but swap space is just a way > > to turn program bugs into performance problems. > > You're hardly the only one. Some years ago, running Linux web and > database servers, I quit creating swap space. A runaway program would > turn the system into an infinite game of shuffle-the-pages well before > the OOM killer actually decided to kill something, and in that state, > one couldn't even reboot. This expanded the time window of "broken" > from tens of seconds, and perhaps a service restart, into tens of > minutes and a power button recovery. Every #$%^&* time. > > De > -------------- next part -------------- An HTML attachment was scrubbed... URL: From drb at msu.edu Sat Mar 1 03:09:49 2025 From: drb at msu.edu (Dennis Boone) Date: Fri, 28 Feb 2025 12:09:49 -0500 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: (Your message of Fri, 28 Feb 2025 16:12:19 +0000.) References: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> Message-ID: <20250228170949.97B694EADFF@yagi.h-net.msu.edu> > I've read several bits of guidance lately suggesting avoiding swap > due to the increasing prevalence of solid-state memories. The > assertion is that I/O heavy swapping, especially if you get into a > thrashing state, is liable to age current storage technologies much > more than it would have in the platter disk era. I was told a few years ago by a smart person I respect that one of the reasons Mac laptops were still so poorly endowed with RAM (iirc 8 GB max) was because Apple was shipping SSD in them, so the performance hit wasn't really noticeable. One hopes those were sturdy (DWPD) SSDs, but barring the marketing decisions (removing ports people want), Apple's hardware engineering is usually decent. De From tuhs at tuhs.org Sat Mar 1 03:17:08 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Fri, 28 Feb 2025 09:17:08 -0800 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> Message-ID: In Feb 2013 I built the raspbian kernel on the original Raspi (512MB). Took about 10 hours. Don't recall if any swap was used but likely. Not sure how different was the raspbian kernel at that time compared to the stock linux kernel (if there is such a thing). I never wanted to repeat the build though! In contrast the plan9 kernel build took a minute. > On Feb 28, 2025, at 8:42 AM, Tom Lyon wrote: > > The counterpoint to "just say no to swapping" is that sometimes you really, really, want a job to proceed even if at a glacial pace. > > Some years ago I had to do a Linux kernel build on a 1GB(maybe less?) MIPS system, and even though it had oodles of swap space, the OOM killer kept biting. > Turns out you have to tune "swappiness" in poorly documented ways to get Linux to actually use virtual memory. I think the build took about a week once it worked. > > On Fri, Feb 28, 2025 at 8:04 AM Dennis Boone > wrote: >> > I’m probably a lost soul on this issue, but swap space is just a way >> > to turn program bugs into performance problems. >> >> You're hardly the only one. Some years ago, running Linux web and >> database servers, I quit creating swap space. A runaway program would >> turn the system into an infinite game of shuffle-the-pages well before >> the OOM killer actually decided to kill something, and in that state, >> one couldn't even reboot. This expanded the time window of "broken" >> from tens of seconds, and perhaps a service restart, into tens of >> minutes and a power button recovery. Every #$%^&* time. >> >> De -------------- next part -------------- An HTML attachment was scrubbed... URL: From woods at robohack.ca Sat Mar 1 05:35:44 2025 From: woods at robohack.ca (Greg A. Woods) Date: Fri, 28 Feb 2025 11:35:44 -0800 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: <20250228170949.97B694EADFF@yagi.h-net.msu.edu> References: <842124E7-A189-4743-B5C2-2F4E9166B1AB@serissa.com> <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <20250228160449.2FDB04EAE12@yagi.h-net.msu.edu> <20250228170949.97B694EADFF@yagi.h-net.msu.edu> Message-ID: At Fri, 28 Feb 2025 12:09:49 -0500, Dennis Boone wrote: Subject: [TUHS] Re: VM over-commit (and the OOM killers) > > I was told a few years ago by a smart person I respect that one of the > reasons Mac laptops were still so poorly endowed with RAM (iirc 8 GB > max) was because Apple was shipping SSD in them, so the performance hit > wasn't really noticeable. One hopes those were sturdy (DWPD) SSDs, but > barring the marketing decisions (removing ports people want), Apple's > hardware engineering is usually decent. That's probably mostly true, but also misguided and ignorant of the actual behaviour of RAM-starved systems. I suspect it was policy based on memory prices at some time in the past, but seemingly hasn't been re-evaluated for modern workloads more recently. MacOS has long been OK at handling a little bit of paging, but seems to get to the state of thrashing far sooner than one would expect it to. I find Apple's default memory provisioning to be half as big as it should be for the real-world uses most people encounter, especially these days with the bloatware that is modern desktop software (especially browsers!) On my BSD systems I always allocate lots of swap space (I try for at least as much as I have RAM, if not more), but I get worried any time I see any of it used. As Warner said, it's fine for transient loads, but if a system starts paging during regular production loads then I have a big problem. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From johnl at taugh.com Sat Mar 1 06:18:36 2025 From: johnl at taugh.com (John Levine) Date: 28 Feb 2025 15:18:36 -0500 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: <47fd2ec1-5896-31a4-8dd5-b91a1c0f8354@baby-dragons.com> <202502272206.51RM6AUB026294@ultimate.com> <20250228170949.97B694EADFF@yagi.h-net.msu.edu> <842 Message-ID: <20250228201836.B76C3BE2FB6B@ary.qy> It appears that Greg A. Woods said: >-=-=-=-=-=- > >At Fri, 28 Feb 2025 12:09:49 -0500, Dennis Boone wrote: >Subject: [TUHS] Re: VM over-commit (and the OOM killers) >> >> I was told a few years ago by a smart person I respect that one of the >> reasons Mac laptops were still so poorly endowed with RAM (iirc 8 GB >> max) was because Apple was shipping SSD in them, ... >That's probably mostly true, but also misguided and ignorant of the >actual behaviour of RAM-starved systems. I suspect it was policy based >on memory prices at some time in the past, but seemingly hasn't been >re-evaluated for modern workloads more recently. Current Macbooks all start at 16GB, and the biggest Pro model can have 128GB and an 8TB SSD for the low low price of $7200. I agree it's memory prices to make a balanced system. For a lot of people who are not doing heavy computing 8GB is fine. If you're doing a lot of compiling or ray-tracing, you need more computer. R's, John From tuhs at tuhs.org Sat Mar 1 06:53:20 2025 From: tuhs at tuhs.org (Arrigo Triulzi via TUHS) Date: Fri, 28 Feb 2025 21:53:20 +0100 Subject: [TUHS] VM over-commit (and the OOM killers) In-Reply-To: References: Message-ID: <218EA880-DBF3-4FAA-B3AB-B73790207181@alchemistowl.org> On 28 Feb 2025, at 20:36, Greg A. Woods wrote: > That's probably mostly true, but also misguided and ignorant of the > actual behaviour of RAM-starved systems. I suspect it was policy based > on memory prices at some time in the past, but seemingly hasn't been > re-evaluated for modern workloads more recently. I strongly agree: as an old Unix hand hitting swap was something you wanted to avoid at all cost, except for certain cases where swap use could be replaced by accurate use of files to swap data in and out of RAM during programme execution because of the predictability of data accesses. It was still bad programming but not as lethal as thrashing. The current macOS is a disappointment in that respect: on a 64GB machine when running the Apple Virtualisation Framework for VMs (UTM and Parallels) I am surprised at how just a couple of 8GB VMs can send the VMM into a frenzy. My FreeBSD bhyve hosts with similar memory and VM load do not exhibit this behaviour at all, despite ZFS ARC. Arrigo From tuhs at tuhs.org Sat Mar 1 10:16:02 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Sat, 01 Mar 2025 00:16:02 +0000 Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: Given that anything that obeys the ABI and has assembler entries to the kernel can request services, it seems to me it would be possible to stand up a user-land without C being present. Have any UNIXen ever done this after the advent of C? - Matt G. From imp at bsdimp.com Sat Mar 1 11:02:35 2025 From: imp at bsdimp.com (Warner Losh) Date: Fri, 28 Feb 2025 18:02:35 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Fri, Feb 28, 2025, 5:16 PM segaloco via TUHS wrote: > Given that anything that obeys the ABI and has assembler entries to the > kernel > can request services, it seems to me it would be possible to stand up a > user-land without C being present. Have any UNIXen ever done this after > the > advent of C? > u-root is a tiny busybox like environment written entirely in go. It goes one step farther: it demand compiles each binary as it's needed. The compiler is small enough for it to be a space win and fast enough that the system performance is adequate for it's primary purpose: using Linux to boot Linux... Warner > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Sat Mar 1 11:15:14 2025 From: clemc at ccc.com (Clem Cole) Date: Fri, 28 Feb 2025 20:15:14 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: Michel Gien lead a Unix reimplementation in Pascal (French SOL project) in the late 1970s/early 1980. Later they did a second version in C++ which was called Chorus. You'll have to ask Michel but I don’t think Sol required a C subsystem but I believe it had one to allow code to be imported. I never knew what compiler his team used but I was once told it was the Amsterdam compilers. If that is true, you could claim there is the boot strap issue if you want to get into the purity of did it need C. But once running there would not have been any need for it to be required. Plus, there were many Pascal compilers in Pascal in those days, but most of the compiler suites that allowed multiple front end were in C (such as the Amsterdam kit and gcc). Sent from a handheld expect more typos than usual On Fri, Feb 28, 2025 at 7:16 PM segaloco via TUHS wrote: > Given that anything that obeys the ABI and has assembler entries to the > kernel > can request services, it seems to me it would be possible to stand up a > user-land without C being present. Have any UNIXen ever done this after > the > advent of C? > > - Matt G. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Sat Mar 1 12:10:29 2025 From: crossd at gmail.com (Dan Cross) Date: Fri, 28 Feb 2025 21:10:29 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Fri, Feb 28, 2025 at 7:26 PM segaloco via TUHS wrote: > Given that anything that obeys the ABI and has assembler entries to the kernel > can request services, it seems to me it would be possible to stand up a > user-land without C being present. Have any UNIXen ever done this after the > advent of C? Kind of. u-root is a userland in Go, but is more targeted towards embedded applications (Ron will correct me if I am mistaken there). uutils is a Rust reimplementation of most of the GNU coreutils tools, but not (I don't think) the C library. - Dan C. From imp at bsdimp.com Sat Mar 1 13:22:41 2025 From: imp at bsdimp.com (Warner Losh) Date: Fri, 28 Feb 2025 20:22:41 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Fri, Feb 28, 2025, 7:11 PM Dan Cross wrote: > On Fri, Feb 28, 2025 at 7:26 PM segaloco via TUHS wrote: > > Given that anything that obeys the ABI and has assembler entries to the > kernel > > can request services, it seems to me it would be possible to stand up a > > user-land without C being present. Have any UNIXen ever done this after > the > > advent of C? > > Kind of. u-root is a userland in Go, but is more targeted towards > embedded applications (Ron will correct me if I am mistaken there) Yea. It's reason to exist is for LinuxBoot scripting, though lots of other uses are possible. I know it from LinuxBoot... uutils is a Rust reimplementation of most of the GNU coreutils tools, > but not (I don't think) the C library. > Correct. Rust uses it's own library for system calls. This is a never ending series of problems for FreeBSD since the C API sometimes differs slightly with what's passed to the kernel and those seemingly trivial differences cause bugs. Also, new system calls can take a while (years) to appear. FreeBSD moved inodes to 64 bits 10 years ago and it was only in the last years that rust stopped using the old 32bit inode system calls. Warner - Dan C. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rminnich at gmail.com Sat Mar 1 14:01:11 2025 From: rminnich at gmail.com (ron minnich) Date: Fri, 28 Feb 2025 20:01:11 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: I started u-root in 2011 because I did not want to do a yocto or onei or buildroot ever again. u-root has found wide application in firmware. U-root runs on anything Go runs on. Andrea Barisini has even targeted Go to bare metal with Tamago, and u-root runs there too. Tinygo can compile much of it to its 200 bare metal targets as well. The late Andrey Mirtchovski, known to many of you, contributed a lot to the early going. I should not digress too much from TUHS here, so I'll leave it at that. On Fri, Feb 28, 2025 at 7:41 PM Warner Losh wrote: > > > On Fri, Feb 28, 2025, 7:11 PM Dan Cross wrote: > >> On Fri, Feb 28, 2025 at 7:26 PM segaloco via TUHS wrote: >> > Given that anything that obeys the ABI and has assembler entries to the >> kernel >> > can request services, it seems to me it would be possible to stand up a >> > user-land without C being present. Have any UNIXen ever done this >> after the >> > advent of C? >> >> Kind of. u-root is a userland in Go, but is more targeted towards >> embedded applications (Ron will correct me if I am mistaken there) > > > Yea. It's reason to exist is for LinuxBoot scripting, though lots of other > uses are possible. I know it from LinuxBoot... > > uutils is a Rust reimplementation of most of the GNU coreutils tools, >> but not (I don't think) the C library. >> > > Correct. Rust uses it's own library for system calls. This is a never > ending series of problems for FreeBSD since the C API sometimes differs > slightly with what's passed to the kernel and those seemingly trivial > differences cause bugs. Also, new system calls can take a while (years) to > appear. FreeBSD moved inodes to 64 bits 10 years ago and it was only in the > last years that rust stopped using the old 32bit inode system calls. > > Warner > > - Dan C. >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Sat Mar 1 22:11:47 2025 From: tuhs at tuhs.org (Jason Stevens via TUHS) Date: Sat, 1 Mar 2025 12:11:47 -0000 Subject: [TUHS] DCJ-11 processor with 20k FPGA Message-ID: I assume people have seen this? https://github.com/ryomuk/TangNanoDCJ11MEM/tree/main It's capable of running Unix v1 & some limited amount of v6 among other things. The FPGA in question the Tang Nano 20k is sub 30GBP delivered from AliExpress. Kind of neat to combine a real processor with a simple FPGA implementation of the hardware. From lyndon at orthanc.ca Sun Mar 2 07:51:29 2025 From: lyndon at orthanc.ca (Lyndon Nerenberg (VE7TFX/VE6BBM)) Date: Sat, 01 Mar 2025 13:51:29 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: segaloco via TUHS writes: > Given that anything that obeys the ABI and has assembler entries to the ker= > nel > can request services, it seems to me it would be possible to stand up a > user-land without C being present. Have any UNIXen ever done this after th= > e > advent of C? SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler and made it a $$$ add on. That move single-handedly made GCC the reference compiler moving forward. --lyndon From lm at mcvoy.com Sun Mar 2 08:01:11 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 1 Mar 2025 14:01:11 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <20250301220111.GB13234@mcvoy.com> On Sat, Mar 01, 2025 at 01:51:29PM -0800, Lyndon Nerenberg (VE7TFX/VE6BBM) wrote: > segaloco via TUHS writes: > > Given that anything that obeys the ABI and has assembler entries to the ker= > > nel > > can request services, it seems to me it would be possible to stand up a > > user-land without C being present. Have any UNIXen ever done this after th= > > e > > advent of C? > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > and made it a $$$ add on. That move single-handedly made GCC the > reference compiler moving forward. I think that OP was asking about a userland compiled with Pascal or Go or Rust, anything other than C. I don't think it was a question of commercial C or Gcc (I agree that it was a bonehead move on Sun's part). -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From crossd at gmail.com Sun Mar 2 08:07:48 2025 From: crossd at gmail.com (Dan Cross) Date: Sat, 1 Mar 2025 17:07:48 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) wrote: > segaloco via TUHS writes: > > Given that anything that obeys the ABI and has assembler entries to the ker= > > nel > > can request services, it seems to me it would be possible to stand up a > > user-land without C being present. Have any UNIXen ever done this after th= > > e > > advent of C? > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > and made it a $$$ add on. That move single-handedly made GCC the > reference compiler moving forward. I believe that was in the shift to Solaris, aka SunOS 5.x. As I recall, even the last versions of SunOS 4 came with a bundled compiler (though it was pre-ANSI, and probably PCC based). - Dan C. From clemc at ccc.com Sun Mar 2 09:28:58 2025 From: clemc at ccc.com (Clem Cole) Date: Sat, 1 Mar 2025 18:28:58 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: Larry, you can correct me here, but it was when Sun finally wrote their own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book style optimizer with its C compiler generated better code than the PCC ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a revenue source. Unlike Masscomp, where we said to our ex-DEC marketing types— "Charge for the Fortran if you want to, but C is part of the system." IIRC: Sun continued to bundled a simple C compiler so you build the kernel, but it was trying to make $s on the compiler suite. SW economics can be difficult. Application firms like CAD or tools firms, of course, make all their money on their SW. But systems companies make their money on the HW and need the compilers to generate the applications to build the ecosystem to sell the HW. Funny thing, I have always said huge reason BLISS lost was that DEC charged for $5000 per CPU for it on TOPS or VMS, while C was free with UNIX - even though the difference is the resulting code was remarkable. So many people stayed away because they did not want to spend the extra $s. ᐧ On Sat, Mar 1, 2025 at 5:08 PM Dan Cross wrote: > On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) > wrote: > > segaloco via TUHS writes: > > > Given that anything that obeys the ABI and has assembler entries to > the ker= > > > nel > > > can request services, it seems to me it would be possible to stand up a > > > user-land without C being present. Have any UNIXen ever done this > after th= > > > e > > > advent of C? > > > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > > and made it a $$$ add on. That move single-handedly made GCC the > > reference compiler moving forward. > > I believe that was in the shift to Solaris, aka SunOS 5.x. As I > recall, even the last versions of SunOS 4 came with a bundled compiler > (though it was pre-ANSI, and probably PCC based). > > - Dan C. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From imp at bsdimp.com Sun Mar 2 09:37:33 2025 From: imp at bsdimp.com (Warner Losh) Date: Sat, 1 Mar 2025 16:37:33 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 4:29 PM Clem Cole wrote: > Larry, you can correct me here, but it was when Sun finally wrote their > own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book > style optimizer with its C compiler generated better code than the PCC > ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a > revenue source. Unlike Masscomp, where we said to our ex-DEC marketing > types— "Charge for the Fortran if you want to, but C is part of the > system." > > IIRC: Sun continued to bundled a simple C compiler so you build the > kernel, but it was trying to make $s on the compiler suite. > Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. > SW economics can be difficult. Application firms like CAD or tools > firms, of course, make all their money on their SW. But systems > companies make their money on the HW and need the compilers to generate the > applications to build the ecosystem to sell the HW. Funny thing, I have > always said huge reason BLISS lost was that DEC charged for $5000 per CPU > for it on TOPS or VMS, while C was free with UNIX - even though the > difference is the resulting code was remarkable. So many people stayed > away because they did not want to spend the extra $s. > Yea, BLISS might have been better, but making the case it was $5k per CPU better was super hard. Warner On Sat, Mar 1, 2025 at 5:08 PM Dan Cross wrote: > >> On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) >> wrote: >> > segaloco via TUHS writes: >> > > Given that anything that obeys the ABI and has assembler entries to >> the ker= >> > > nel >> > > can request services, it seems to me it would be possible to stand up >> a >> > > user-land without C being present. Have any UNIXen ever done this >> after th= >> > > e >> > > advent of C? >> > >> > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler >> > and made it a $$$ add on. That move single-handedly made GCC the >> > reference compiler moving forward. >> >> I believe that was in the shift to Solaris, aka SunOS 5.x. As I >> recall, even the last versions of SunOS 4 came with a bundled compiler >> (though it was pre-ANSI, and probably PCC based). >> >> - Dan C. >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From lm at mcvoy.com Sun Mar 2 09:38:05 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 1 Mar 2025 15:38:05 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <20250301233805.GC13234@mcvoy.com> I didn't have anything to do with the compiler group so I have no idea what they were thinking. So long as cc compiled the kernel without bugs, they did their own thing. I have no idea who thought it was a good idea to unbundle C, I was not a fan but wasn't important enough that my opinion held sway. Rob Gingell would likely know. On Sat, Mar 01, 2025 at 06:28:58PM -0500, Clem Cole wrote: > Larry, you can correct me here, but it was when Sun finally wrote their own > - learning from DEC [and Masscomp] that the real Bill Wulf' Green book > style optimizer with its C compiler generated better code than the PCC > ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a > revenue source. Unlike Masscomp, where we said to our ex-DEC marketing > types??? "Charge for the Fortran if you want to, but C is part of the > system." > > IIRC: Sun continued to bundled a simple C compiler so you build the kernel, > but it was trying to make $s on the compiler suite. > > SW economics can be difficult. Application firms like CAD or tools > firms, of course, make all their money on their SW. But systems > companies make their money on the HW and need the compilers to generate the > applications to build the ecosystem to sell the HW. Funny thing, I have > always said huge reason BLISS lost was that DEC charged for $5000 per CPU > for it on TOPS or VMS, while C was free with UNIX - even though the > difference is the resulting code was remarkable. So many people stayed > away because they did not want to spend the extra $s. > ??? > > On Sat, Mar 1, 2025 at 5:08???PM Dan Cross wrote: > > > On Sat, Mar 1, 2025 at 5:01???PM Lyndon Nerenberg (VE7TFX/VE6BBM) > > wrote: > > > segaloco via TUHS writes: > > > > Given that anything that obeys the ABI and has assembler entries to > > the ker= > > > > nel > > > > can request services, it seems to me it would be possible to stand up a > > > > user-land without C being present. Have any UNIXen ever done this > > after th= > > > > e > > > > advent of C? > > > > > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > > > and made it a $$$ add on. That move single-handedly made GCC the > > > reference compiler moving forward. > > > > I believe that was in the shift to Solaris, aka SunOS 5.x. As I > > recall, even the last versions of SunOS 4 came with a bundled compiler > > (though it was pre-ANSI, and probably PCC based). > > > > - Dan C. > > -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From tuhs at tuhs.org Sun Mar 2 09:46:59 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Sat, 01 Mar 2025 23:46:59 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Saturday, March 1st, 2025 at 3:37 PM, Warner Losh wrote: > > > On Sat, Mar 1, 2025 at 4:29 PM Clem Cole wrote: > > > Larry, you can correct me here, but it was when Sun finally wrote their own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book style optimizer with its C compiler generated better code than the PCC ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a revenue source. Unlike Masscomp, where we said to our ex-DEC marketing types— "Charge for the Fortran if you want to, but C is part of the system." > > > > IIRC: Sun continued to bundled a simple C compiler so you build the kernel, but it was trying to make $s on the compiler suite. > > > Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. > > > SW economics can be difficult. Application firms like CAD or tools firms, of course, make all their money on their SW. But systems companies make their money on the HW and need the compilers to generate the applications to build the ecosystem to sell the HW. Funny thing, I have always said huge reason BLISS lost was that DEC charged for $5000 per CPU for it on TOPS or VMS, while C was free with UNIX - even though the difference is the resulting code was remarkable. So many people stayed away because they did not want to spend the extra $s. > > > Yea, BLISS might have been better, but making the case it was $5k per CPU better was super hard. > > > Warner > I don't think like a business type, but to my mind cheap-to-free, accessible development tools are a no-brainer. More surface area for devs means more devs able to target your system means more likelihood of killer apps showing up on your platform. If it's locked up so tight nobody can sustainably write software, then the lifeblood of your OS goes down the tubes. Again though, not claiming to be an expert on the business side of things, restrictive dev tool access just echoes of the same gripes I have with hardware IP for which the interfaces/datasheets are under expensive and arcane NDAs. Like, okay, thanks for making the risk of evaluating your platform untenable for a large tract of the software community I guess...eval boards are already expensive enough. But I digress, I seem to recall reading in another thread here that AIX may have had a fair deal of IBM stuff like perhaps some PL/I or PL/S down in the guts of significant chunks at one point, but I couldn't speak to that with any authority. I could see IBM what with their legacy in languages bristling at letting C be the star of the show. - Matt G. From crossd at gmail.com Sun Mar 2 09:46:46 2025 From: crossd at gmail.com (Dan Cross) Date: Sat, 1 Mar 2025 18:46:46 -0500 Subject: [TUHS] Compilers (was Re: Any UNIX With No C In Userland?) In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 6:37 PM Warner Losh wrote: > On Sat, Mar 1, 2025 at 4:29 PM Clem Cole wrote: >> Larry, you can correct me here, but it was when Sun finally wrote their own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book style optimizer with its C compiler generated better code than the PCC ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a revenue source. Unlike Masscomp, where we said to our ex-DEC marketing types— "Charge for the Fortran if you want to, but C is part of the system." >> >> IIRC: Sun continued to bundled a simple C compiler so you build the kernel, but it was trying to make $s on the compiler suite. > > Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. It's been years but I remember bootstrapping gcc with the `cc` that came with SunOS (was it in /usr/ucb? For some reason that sticks in my memory). I don't recall additional workaround steps, though perhaps one had to install other GNU utilities? Gcc got you ANSI and some amount of compatibility with C++ (which hadn't yet been standardized). I can't remember if Objective-C was in gcc at the time yet, but I suspect it was via NeXT. But beyond raw language compatibility, I don't remember it being a very good compiler at the time. It didn't really start to become competitive until GCC 3 or so. >> SW economics can be difficult. Application firms like CAD or tools firms, of course, make all their money on their SW. But systems companies make their money on the HW and need the compilers to generate the applications to build the ecosystem to sell the HW. Funny thing, I have always said huge reason BLISS lost was that DEC charged for $5000 per CPU for it on TOPS or VMS, while C was free with UNIX - even though the difference is the resulting code was remarkable. So many people stayed away because they did not want to spend the extra $s. > > Yea, BLISS might have been better, but making the case it was $5k per CPU better was super hard. People doing HPC locally used to pay for better commercial compilers; GCC and Clang have definitely caught up now, but I'm sure they were only concerned with Fortran at the time. The Portland Group compiler was pretty darned good in the mid-90s when we were starting to look at cheap x86 boxes because of the price/performance ratio: even though the big RISC machines were still better in terms of absolute numbers, they were clearly losing the battle on the cost ratio. - Dan C. >> On Sat, Mar 1, 2025 at 5:08 PM Dan Cross wrote: >>> >>> On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) >>> wrote: >>> > segaloco via TUHS writes: >>> > > Given that anything that obeys the ABI and has assembler entries to the ker= >>> > > nel >>> > > can request services, it seems to me it would be possible to stand up a >>> > > user-land without C being present. Have any UNIXen ever done this after th= >>> > > e >>> > > advent of C? >>> > >>> > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler >>> > and made it a $$$ add on. That move single-handedly made GCC the >>> > reference compiler moving forward. >>> >>> I believe that was in the shift to Solaris, aka SunOS 5.x. As I >>> recall, even the last versions of SunOS 4 came with a bundled compiler >>> (though it was pre-ANSI, and probably PCC based). >>> >>> - Dan C. From imp at bsdimp.com Sun Mar 2 09:57:39 2025 From: imp at bsdimp.com (Warner Losh) Date: Sat, 1 Mar 2025 16:57:39 -0700 Subject: [TUHS] Compilers (was Re: Any UNIX With No C In Userland?) In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 4:47 PM Dan Cross wrote: > On Sat, Mar 1, 2025 at 6:37 PM Warner Losh wrote: > > On Sat, Mar 1, 2025 at 4:29 PM Clem Cole wrote: > >> Larry, you can correct me here, but it was when Sun finally wrote their > own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book > style optimizer with its C compiler generated better code than the PCC > ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a > revenue source. Unlike Masscomp, where we said to our ex-DEC marketing > types— "Charge for the Fortran if you want to, but C is part of the system." > >> > >> IIRC: Sun continued to bundled a simple C compiler so you build the > kernel, but it was trying to make $s on the compiler suite. > > > > Yes. I think so. But I also think that said C compiler wasn't adequate > to bootstrap gcc or that there were extra steps / workarounds needed to do > that. This was during the K&R -> ANSI cutover that Sun did this as well, > and the old compiler was definitely K&R only. > > It's been years but I remember bootstrapping gcc with the `cc` that > came with SunOS (was it in /usr/ucb? For some reason that sticks in my > memory). I don't recall additional workaround steps, though perhaps > one had to install other GNU utilities? > That may be what I'm recalling... you needed gmake, binutils, bison, flex and maybe a few others for sure... I also recall that 1.x was a lot harder to get going than 2.x on sparc. > Gcc got you ANSI and some amount of compatibility with C++ (which > hadn't yet been standardized). I can't remember if Objective-C was in > gcc at the time yet, but I suspect it was via NeXT. But beyond raw > language compatibility, I don't remember it being a very good compiler > at the time. It didn't really start to become competitive until GCC 3 > or so. > Yea, even gcc2 was a huge win. > >> SW economics can be difficult. Application firms like CAD or tools > firms, of course, make all their money on their SW. But systems companies > make their money on the HW and need the compilers to generate the > applications to build the ecosystem to sell the HW. Funny thing, I have > always said huge reason BLISS lost was that DEC charged for $5000 per CPU > for it on TOPS or VMS, while C was free with UNIX - even though the > difference is the resulting code was remarkable. So many people stayed > away because they did not want to spend the extra $s. > > > > Yea, BLISS might have been better, but making the case it was $5k per > CPU better was super hard. > > People doing HPC locally used to pay for better commercial compilers; > GCC and Clang have definitely caught up now, but I'm sure they were > only concerned with Fortran at the time. The Portland Group compiler > was pretty darned good in the mid-90s when we were starting to look at > cheap x86 boxes because of the price/performance ratio: even though > the big RISC machines were still better in terms of absolute numbers, > they were clearly losing the battle on the cost ratio. > Yea. There comes a point where large swarms of adequate at a reasonable price is a better call than a few, more expensive things that get the job done in a smaller footprint of floor space or time (but higher cost). For a lot of things that crossover took place in the 2000-2010 time frame. Warner > - Dan C. > > >> On Sat, Mar 1, 2025 at 5:08 PM Dan Cross wrote: > >>> > >>> On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) > >>> wrote: > >>> > segaloco via TUHS writes: > >>> > > Given that anything that obeys the ABI and has assembler entries > to the ker= > >>> > > nel > >>> > > can request services, it seems to me it would be possible to stand > up a > >>> > > user-land without C being present. Have any UNIXen ever done this > after th= > >>> > > e > >>> > > advent of C? > >>> > > >>> > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > >>> > and made it a $$$ add on. That move single-handedly made GCC the > >>> > reference compiler moving forward. > >>> > >>> I believe that was in the shift to Solaris, aka SunOS 5.x. As I > >>> recall, even the last versions of SunOS 4 came with a bundled compiler > >>> (though it was pre-ANSI, and probably PCC based). > >>> > >>> - Dan C. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sauer at technologists.com Sun Mar 2 10:10:58 2025 From: sauer at technologists.com (Charles H Sauer (he/him)) Date: Sat, 1 Mar 2025 18:10:58 -0600 Subject: [TUHS] AIX (RT/PC & RS/6000) 1/2/3 compilers [was Re: Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On 3/1/2025 5:46 PM, segaloco via TUHS wrote: > > But I digress, I seem to recall reading in another thread here that AIX may have had a fair deal of IBM stuff like perhaps some PL/I or PL/S down in the guts of significant chunks at one point, but I couldn't speak to that with any authority. I could see IBM what with their legacy in languages bristling at letting C be the star of the show. > > - Matt G. For all our faults in the AIX team and IBM in general, there was no desire to have legacy IBM languages as primary in AIX. https://notes.technologists.com/notes/2017/03/08/lets-start-at-the-very-beginning-801-romp-rtpc-aix-versions/ tries to explain why PL.8 code existed in early AIX and how PL.8 code was eliminated in AIX 3 development. AIX for the RT 1&2 bundled pcc (with the HCR optimization phase in AIX 2). Since I left IBM in the midst of AIX 3 development, I'm not certain what happened with compilers after I left. Part of the confusion was IBM Toronto rewriting the Yorktown Research compiler to be a "product worthy" C compiler. There may have been desire to gain revenue for the Toronto compiler, but I assume that some some C compiler was bundled in AIX 3. Charlie -- voice: +1.512.784.7526 e-mail: sauer at technologists.com fax: +1.512.346.5240 Web: https://technologists.com/sauer/ Facebook/Google/LinkedIn/mas.to: CharlesHSauer From peter.martin.yardley at gmail.com Sun Mar 2 10:46:27 2025 From: peter.martin.yardley at gmail.com (Peter Yardley) Date: Sun, 2 Mar 2025 11:46:27 +1100 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <77926C79-1344-415A-9D23-BEC53D84CD1E@gmail.com> I remember building gcc with the bundled SUN C compiler. Cant’t remember what flags etc were needed. We used to build it three times, to be sure, to be sure, to be sure. Once to get a working compiler, again to remove taint and once more to get a build made with a taint free compiler. > On 2 Mar 2025, at 10:37 am, Warner Losh wrote: > > > > On Sat, Mar 1, 2025 at 4:29 PM Clem Cole wrote: > Larry, you can correct me here, but it was when Sun finally wrote their own - learning from DEC [and Masscomp] that the real Bill Wulf' Green book style optimizer with its C compiler generated better code than the PCC ones. Unfortunately, Sun's marketing (also ex-DEC) decided it could be a revenue source. Unlike Masscomp, where we said to our ex-DEC marketing types— "Charge for the Fortran if you want to, but C is part of the system." > > IIRC: Sun continued to bundled a simple C compiler so you build the kernel, but it was trying to make $s on the compiler suite. > > Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. > SW economics can be difficult. Application firms like CAD or tools firms, of course, make all their money on their SW. But systems companies make their money on the HW and need the compilers to generate the applications to build the ecosystem to sell the HW. Funny thing, I have always said huge reason BLISS lost was that DEC charged for $5000 per CPU for it on TOPS or VMS, while C was free with UNIX - even though the difference is the resulting code was remarkable. So many people stayed away because they did not want to spend the extra $s. > > Yea, BLISS might have been better, but making the case it was $5k per CPU better was super hard. > > Warner > > On Sat, Mar 1, 2025 at 5:08 PM Dan Cross wrote: > On Sat, Mar 1, 2025 at 5:01 PM Lyndon Nerenberg (VE7TFX/VE6BBM) > wrote: > > segaloco via TUHS writes: > > > Given that anything that obeys the ABI and has assembler entries to the ker= > > > nel > > > can request services, it seems to me it would be possible to stand up a > > > user-land without C being present. Have any UNIXen ever done this after th= > > > e > > > advent of C? > > > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > > and made it a $$$ add on. That move single-handedly made GCC the > > reference compiler moving forward. > > I believe that was in the shift to Solaris, aka SunOS 5.x. As I > recall, even the last versions of SunOS 4 came with a bundled compiler > (though it was pre-ANSI, and probably PCC based). > > - Dan C. .1.3.6.1.4.1.8852.4.2 Peter Yardley peter.martin.yardley at gmail.com From lm at mcvoy.com Sun Mar 2 11:17:40 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 1 Mar 2025 17:17:40 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250301220111.GB13234@mcvoy.com> References: <20250301220111.GB13234@mcvoy.com> Message-ID: <20250302011740.GE31995@mcvoy.com> On Sat, Mar 01, 2025 at 02:01:11PM -0800, Larry McVoy wrote: > On Sat, Mar 01, 2025 at 01:51:29PM -0800, Lyndon Nerenberg (VE7TFX/VE6BBM) wrote: > > segaloco via TUHS writes: > > > Given that anything that obeys the ABI and has assembler entries to the ker= > > > nel > > > can request services, it seems to me it would be possible to stand up a > > > user-land without C being present. Have any UNIXen ever done this after th= > > > e > > > advent of C? > > > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > > and made it a $$$ add on. That move single-handedly made GCC the > > reference compiler moving forward. > > I think that OP was asking about a userland compiled with Pascal or Go > or Rust, anything other than C. > > I don't think it was a question of commercial C or Gcc (I agree that it > was a bonehead move on Sun's part). The funny thing was that somewhere around then Sun Labs was paying Micheal Tiemann to make g++ work. With a deal that let him retain the rights to the code. I never understood that, one hand wants to charge for cc and the other hand is paying for free g++? The ways of Sun could be strange. From tuhs at tuhs.org Sun Mar 2 11:33:18 2025 From: tuhs at tuhs.org (Grant Taylor via TUHS) Date: Sat, 1 Mar 2025 19:33:18 -0600 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <77926C79-1344-415A-9D23-BEC53D84CD1E@gmail.com> References: <77926C79-1344-415A-9D23-BEC53D84CD1E@gmail.com> Message-ID: <2a342b4f-b585-4d1f-9a3c-68ff317406a8@spamtrap.tnetconsulting.net> On 3/1/25 6:46 PM, Peter Yardley wrote: > We used to build it three times, to be sure, to be sure, to be > sure. Once to get a working compiler, again to remove taint and once > more to get a build made with a taint free compiler. I've seen hints of this in Gentoo Stage 1 installs. Use existing tools to build Gentoo versions of the tools. Then use the Gentoo version of the tools to re-build the Gentoo version of the tools. Then do an empty tree rebuild of everything using the Gentoo version of the tools. -- Grant. . . . From tuhs at tuhs.org Sun Mar 2 11:58:15 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sat, 1 Mar 2025 17:58:15 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> On Feb 28, 2025, at 4:16 PM, segaloco via TUHS wrote: > > Given that anything that obeys the ABI and has assembler entries to the kernel > can request services, it seems to me it would be possible to stand up a > user-land without C being present. Have any UNIXen ever done this after the > advent of C? vinix is written in the V language and can have userland written in V as well but so far the compiler compiles to c (so requires a c compiler) so not sure it qualifies + its development seems to have stalled somewhat. IIRC there is at least one v6 level (toy?) OS written in rust with a rust only userland. May be for anything non-toy you'd have to provide a C compatible API. Personally I don't see much point in reimplementing a 50+ year old OS in a new language; at least do something new and innovative (with unix emulation for dusty decks) but ¯\_(ツ)_/¯ From g.branden.robinson at gmail.com Sun Mar 2 12:07:37 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Sat, 1 Mar 2025 20:07:37 -0600 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302011740.GE31995@mcvoy.com> References: <20250301220111.GB13234@mcvoy.com> <20250302011740.GE31995@mcvoy.com> Message-ID: <20250302020737.aw6ef53l6h63pmew@illithid> At 2025-03-01T17:17:40-0800, Larry McVoy wrote: > The funny thing was that somewhere around then Sun Labs was paying > Micheal Tiemann to make g++ work. With a deal that let him retain the > rights to the code. What would the alternative have been? If the goal was "to make g++ work", then the code would, at the time, have had to be contributed to the FSF anyway. My understanding is that the FSF's copyright assignment agreements, while scandalous in some circles at the time (and remain so in a few), were far more generous than today's contributor license agreements. So Tiemann would have retained rights to what he wrote, but neither he nor Sun could have used it to stand up a competing proprietary C++ compiler without reimplementing the rest of said compiler, because of [dramatic orchestral sting] the GPL. > I never understood that, one hand wants to charge for cc and the other hand is paying for free g++? Did Sun's proprietary C compiler also handle C++ at that time? If not, then having a C++ story on SunOS and/or Solaris could easily have been more important than fear of lost revenue to a C++ compiler they couldn't offer an alternative to anyway. If it did, it _still_ might have made sense, because while I was only barely conscious of the C++ marketplace at the time, with the language's standardization nigh (for definitions of "nigh" stretching out many more years than anyone planned), Sun's compiler guys might have feared giving up mindshare and influence on the future ISO C++ to Borland or Microsoft. The foregoing speculative concern is consistent with the speed of Sun's pivot to Oak/Java around the time Microsoft showed that (1) Windows NT was going to block further penetration of Unix into the server OS sector and substantially erode Unix's place in workstation and high-end PC configurations _and_ (2) it was going to destroy Borland C++ with Visual C++ by hook or by crook (the sort of stuff that got Bill Gates haled into a CSPAN-televised Congressional hearing a few years later, but with respect to Web browsers). > The ways of Sun could be strange. Until you get to a level where decision makers truly have no accountability for anything they do (don't strain yourself looking for examples in the news), I've found (from my minuscule sampling of industry) that business decisions can usually be explained rationally. The problem is that the rationales are often kept secret, sometimes in the name of "strategy" but more often, I think, to keep line staff in their place instead of critically considering management decisions-- unless they're willing to join political alliances and thereby make themselves (even more) vulnerable to dismissal for unspoken reasons. That said, I'm sure it would surprise no one if the "real reason" turned out to be a stupid one. Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From g.branden.robinson at gmail.com Sun Mar 2 12:14:27 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Sat, 1 Mar 2025 20:14:27 -0600 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302020737.aw6ef53l6h63pmew@illithid> References: <20250301220111.GB13234@mcvoy.com> <20250302011740.GE31995@mcvoy.com> <20250302020737.aw6ef53l6h63pmew@illithid> Message-ID: <20250302021427.ekjdiyc7eu7446of@illithid> At 2025-03-01T20:07:42-0600, G. Branden Robinson wrote: > If it did, it _still_ might have made sense, because while I was only > barely conscious of the C++ marketplace at the time, with the language's > standardization nigh (for definitions of "nigh" stretching out many more > years than anyone planned), Sun's compiler guys might have feared giving > up mindshare and influence on the future ISO C++ to Borland or > Microsoft. I forgot to mention some support for this argument. I quote Bjarne Stroustrup in the 4th edition of his central C++ book. "Clearly, the 1998 language was far superior in features and in particular the detail of specification to the 1989 language. However, not all changes were improvements. In addition to the inevitable minor mistakes, two major features were added that should not have been: * Exception specifications provide run-time enforcement of which exceptions a function is allowed to throw. They were added at the energetic initiative of people from Sun Microsystems. Exception specifications turned out to be worse than useless for improving readability, reliability, and performance. They are deprecated (scheduled for removal) in the 2011 standard. ..." (p. 26) Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From lm at mcvoy.com Sun Mar 2 12:16:57 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 1 Mar 2025 18:16:57 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> Message-ID: <20250302021657.GD13234@mcvoy.com> On Sat, Mar 01, 2025 at 05:58:15PM -0800, Bakul Shah via TUHS wrote: > On Feb 28, 2025, at 4:16???PM, segaloco via TUHS wrote: > > > > Given that anything that obeys the ABI and has assembler entries to the kernel > > can request services, it seems to me it would be possible to stand up a > > user-land without C being present. Have any UNIXen ever done this after the > > advent of C? > > vinix is written in the V language and can have userland written in V as > well but so far the compiler compiles to c (so requires a c compiler) so > not sure it qualifies + its development seems to have stalled somewhat. > > IIRC there is at least one v6 level (toy?) OS written in rust with a > rust only userland. May be for anything non-toy you'd have to provide > a C compatible API. > > Personally I don't see much point in reimplementing a 50+ year old OS in > a new language; at least do something new and innovative (with unix > emulation for dusty decks) but ??\_(???)_/?? I took a look at the V language, not a fan. But I'm a grumpy old C programmer and I tend to hate languages that change the syntax for no good reason. I would much prefer to see C evolved. Not like C++ but just better C. It dismays me that people feel like they need to make a mark with some new "better" syntax. C syntax is fine, it's well understood by a zillion programmers. So why not evolve that syntax to a better C? We did that with https://www.little-lang.org/index.html which is a compiled C like language. There are a ton of extensions to C that just made sense to me. It never caught on but it still stands as an example of how you could take C syntax and extend it to be more useful. Why people don't do that is a mystery to me. -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From crossd at gmail.com Sun Mar 2 12:43:56 2025 From: crossd at gmail.com (Dan Cross) Date: Sat, 1 Mar 2025 21:43:56 -0500 Subject: [TUHS] AIX (RT/PC & RS/6000) 1/2/3 compilers [was Re: Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 7:21 PM Charles H Sauer (he/him) wrote: > On 3/1/2025 5:46 PM, segaloco via TUHS wrote: > > But I digress, I seem to recall reading in another thread here that AIX may have had a fair deal of IBM stuff like perhaps some PL/I or PL/S down in the guts of significant chunks at one point, but I couldn't speak to that with any authority. I could see IBM what with their legacy in languages bristling at letting C be the star of the show. > > For all our faults in the AIX team and IBM in general, there was no > desire to have legacy IBM languages as primary in AIX. > > https://notes.technologists.com/notes/2017/03/08/lets-start-at-the-very-beginning-801-romp-rtpc-aix-versions/ > tries to explain why PL.8 code existed in early AIX and how PL.8 code > was eliminated in AIX 3 development. > > AIX for the RT 1&2 bundled pcc (with the HCR optimization phase in AIX 2). > > Since I left IBM in the midst of AIX 3 development, I'm not certain what > happened with compilers after I left. Part of the confusion was IBM > Toronto rewriting the Yorktown Research compiler to be a "product > worthy" C compiler. There may have been desire to gain revenue for the > Toronto compiler, but I assume that some some C compiler was bundled in > AIX 3. As I recall, you needed a license for the compiler suite on AIX 3; the compiler was XL C, and was very highly regarded (other compilers in the same general family were XL C++ and XL Fortran). By AIX 4 this was definitely true. - Dan C. From tuhs at tuhs.org Sun Mar 2 13:00:03 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sat, 1 Mar 2025 19:00:03 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302021657.GD13234@mcvoy.com> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> Message-ID: <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> On Mar 1, 2025, at 6:16 PM, Larry McVoy wrote: > > I took a look at the V language, not a fan. But I'm a grumpy old C > programmer and I tend to hate languages that change the syntax for > no good reason. There are quite a few newer languages but none outstanding (IMHO). I do like V but that may be because I haven't written enough code in it! > I would much prefer to see C evolved. Not like C++ but just better C. > It dismays me that people feel like they need to make a mark with some > new "better" syntax. C syntax is fine, it's well understood by a zillion > programmers. So why not evolve that syntax to a better C? AFAIK all such attempts have failed. C is a great portable assembler but not so great as a high level language. > We did that with https://www.little-lang.org/index.html which is a compiled > C like language. There are a ton of extensions to C that just made sense > to me. It never caught on but it still stands as an example of how you > could take C syntax and extend it to be more useful. Why people don't > do that is a mystery to me. I looked at it when it was mentioned in the past. Didn't like it enough. Anyway, my main point was that please do something innovative and interesting if you must use a new HLL for an OS! From lm at mcvoy.com Sun Mar 2 13:16:47 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 1 Mar 2025 19:16:47 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> Message-ID: <20250302031646.GF13234@mcvoy.com> On Sat, Mar 01, 2025 at 07:00:03PM -0800, Bakul Shah wrote: > On Mar 1, 2025, at 6:16???PM, Larry McVoy wrote: > > We did that with https://www.little-lang.org/index.html which is a compiled > > C like language. There are a ton of extensions to C that just made sense > > to me. It never caught on but it still stands as an example of how you > > could take C syntax and extend it to be more useful. Why people don't > > do that is a mystery to me. > > I looked at it when it was mentioned in the past. Didn't like it enough. Yeah, it has no traction, I don't blame you in the slightest. > Anyway, my main point was that please do something innovative and > interesting if you must use a new HLL for an OS! So little-lang wasn't for OS stuff, it was for userland, but I feel it was plenty innovative. It fit into C syntax but added a bunch of useful stuff. string type that managed the memory in the language, no more malloc/free crap. Case statements that could take variables, regexp as switches. Perl like I/O with regex built in. It was just a pile of pleasant enhancements to C. And it could be an OS language, I don't see why not. If I had enough money, I'd fund a gcc --lang=Little syntax and the world would be a better place. In. My. Opinion. Not everyone elses but I really believe if gcc had that dialect a lot of C people would move to it. I'm not pushing my pet language, I'm holding it up as an example of how you could make a new programming language. Take what works, extend it with what works. Preserve people's knowledge rather than force them to learn a new syntax that does the same thing. Extend rather than replace. From imp at bsdimp.com Sun Mar 2 14:31:43 2025 From: imp at bsdimp.com (Warner Losh) Date: Sat, 1 Mar 2025 21:31:43 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302020737.aw6ef53l6h63pmew@illithid> References: <20250301220111.GB13234@mcvoy.com> <20250302011740.GE31995@mcvoy.com> <20250302020737.aw6ef53l6h63pmew@illithid> Message-ID: On Sat, Mar 1, 2025, 7:07 PM G. Branden Robinson < g.branden.robinson at gmail.com> wrote: > Did Sun's proprietary C compiler also handle C++ at that time? > Yes. They had a cfront interim release that we tested, but their SUNWpro compiler later replaced it with something better, but I don't recall the exact details. I do recall SUNWpro was pickier than the older pcc + cfront one that we used before. If it did, it _still_ might have made sense, because while I was only > barely conscious of the C++ marketplace at the time, with the language's > standardization nigh (for definitions of "nigh" stretching out many more > years than anyone planned), Sun's compiler guys might have feared giving > up mindshare and influence on the future ISO C++ to Borland or > Microsoft. > SUNWpro compilers pre-date any standardization efforts... The ARM was in this time, and C++ was struggling to finalize enough things to make it worthwhile to get it to ISO/ANSI. > The foregoing speculative concern is consistent with the speed of Sun's > pivot to Oak/Java around the time Microsoft showed that (1) Windows NT > was going to block further penetration of Unix into the server OS sector > and substantially erode Unix's place in workstation and high-end PC > configurations _and_ (2) it was going to destroy Borland C++ with Visual > C++ by hook or by crook (the sort of stuff that got Bill Gates haled > into a CSPAN-televised Congressional hearing a few years later, but with > respect to Web browsers). > JAVA post-dates this pivot by a few years... That said, I'm sure it would surprise no one if the "real reason" turned > out to be a stupid one. > Warner -------------- next part -------------- An HTML attachment was scrubbed... URL: From lm at mcvoy.com Mon Mar 3 01:46:23 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sun, 2 Mar 2025 07:46:23 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302020737.aw6ef53l6h63pmew@illithid> References: <20250301220111.GB13234@mcvoy.com> <20250302011740.GE31995@mcvoy.com> <20250302020737.aw6ef53l6h63pmew@illithid> Message-ID: <20250302154623.GG13234@mcvoy.com> On Sat, Mar 01, 2025 at 08:07:37PM -0600, G. Branden Robinson wrote: > Until you get to a level where decision makers truly have no > accountability for anything they do (don't strain yourself looking for > examples in the news), I've found (from my minuscule sampling of > industry) that business decisions can usually be explained rationally. > The problem is that the rationales are often kept secret, sometimes in > the name of "strategy" but more often, I think, to keep line staff in > their place instead of critically considering management decisions-- > unless they're willing to join political alliances and thereby make > themselves (even more) vulnerable to dismissal for unspoken reasons. As an engineer who was paid to try and convince Sun's execs that the pivot to Solaris/System V was a mistake, I have a different take. The 6 months I spent at the top of PAL-1 where McNealy & Co worked were unsuccessful but very educational. Hanging out there gave me insight into what all those execs did. It's very different from what an engineer does, we figure stuff out to the Nth degree. The execs did not have that luxury. What they did all day, every day, is make decisions about what to do but with only about 1/10th the amount of information an engineer would have. They simply couldn't wait long enough to know everything they wanted to know, the rest of the industry would pass them by. I would HATE to have their job. As I grew to understand their jobs, my respect for them went up but there is no amount of money that would make me want that job. From stuff at riddermarkfarm.ca Mon Mar 3 02:35:21 2025 From: stuff at riddermarkfarm.ca (Stuff Received) Date: Sun, 2 Mar 2025 11:35:21 -0500 Subject: [TUHS] AIX (RT/PC & RS/6000) 1/2/3 compilers [was Re: Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <31abe80b-6ecc-26a9-698b-1513224b8a17@riddermarkfarm.ca> On 2025-03-01 21:43, Dan Cross wrote: > On Sat, Mar 1, 2025 at 7:21 PM Charles H Sauer (he/him) > wrote: >> On 3/1/2025 5:46 PM, segaloco via TUHS wrote: >>> But I digress, I seem to recall reading in another thread here that AIX may have had a fair deal of IBM stuff like perhaps some PL/I or PL/S down in the guts of significant chunks at one point, but I couldn't speak to that with any authority. I could see IBM what with their legacy in languages bristling at letting C be the star of the show. >> >> For all our faults in the AIX team and IBM in general, there was no >> desire to have legacy IBM languages as primary in AIX. >> >> https://notes.technologists.com/notes/2017/03/08/lets-start-at-the-very-beginning-801-romp-rtpc-aix-versions/ >> tries to explain why PL.8 code existed in early AIX and how PL.8 code >> was eliminated in AIX 3 development. >> >> AIX for the RT 1&2 bundled pcc (with the HCR optimization phase in AIX 2). >> >> Since I left IBM in the midst of AIX 3 development, I'm not certain what >> happened with compilers after I left. Part of the confusion was IBM >> Toronto rewriting the Yorktown Research compiler to be a "product >> worthy" C compiler. There may have been desire to gain revenue for the >> Toronto compiler, but I assume that some some C compiler was bundled in >> AIX 3. > > As I recall, you needed a license for the compiler suite on AIX 3; the > compiler was XL C, and was very highly regarded (other compilers in > the same general family were XL C++ and XL Fortran). By AIX 4 this was > definitely true. > > - Dan C. Some decades ago, I was checking whether our product ran properly on AIX (version forgotten). The xlc man page stated that at the highest optimisation level, the "semantics of the source code might not be respected" (or something very similar). I no longer remember the versions. Has anyone else seen this? S. From merlyn at geeks.org Mon Mar 3 02:45:01 2025 From: merlyn at geeks.org (Doug McIntyre) Date: Sun, 2 Mar 2025 10:45:01 -0600 Subject: [TUHS] Compilers (was Re: Any UNIX With No C In Userland?) In-Reply-To: References: Message-ID: On Sat, Mar 01, 2025 at 06:46:46PM -0500, Dan Cross wrote: > On Sat, Mar 1, 2025 at 6:37 PM Warner Losh wrote: >> Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. > > It's been years but I remember bootstrapping gcc with the `cc` that > came with SunOS (was it in /usr/ucb? For some reason that sticks in my > memory). I don't recall additional workaround steps, though perhaps > one had to install other GNU utilities? In that time period, the gcc distribution had a special bootstrap version (older version?) of itself that was capable specificly of being able to be compiled by the cc that SunOS shipped with that was intended to compile the kernel bits shipped to configure in new limits/drivers, etc. The first stages of getting gcc compiled on SunOS was compiling that bootstrap version of itself, and then using that bootstraped special version of gcc to then compile the regular/up-to-date version of gcc. Eventually after some gcc major version, it stopped supporting/bundling this special bootstrap version, and dropped support for bootstrapping itself up on SunOS. From tuhs at tuhs.org Mon Mar 3 03:29:31 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sun, 2 Mar 2025 09:29:31 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302031646.GF13234@mcvoy.com> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> Message-ID: <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> On Mar 1, 2025, at 7:16 PM, Larry McVoy wrote: > >> >> Anyway, my main point was that please do something innovative and >> interesting if you must use a new HLL for an OS! > > So little-lang wasn't for OS stuff, it was for userland, but I feel it was > plenty innovative. It fit into C syntax but added a bunch of useful stuff. > string type that managed the memory in the language, no more malloc/free > crap. Case statements that could take variables, regexp as switches. > Perl like I/O with regex built in. It was just a pile of pleasant > enhancements to C. My comment was about innovation in the OS, not about any HLL. > And it could be an OS language, I don't see why not. Sure. > If I had enough money, I'd fund a gcc --lang=Little syntax and the world > would be a better place. In. My. Opinion. Not everyone elses but I > really believe if gcc had that dialect a lot of C people would move to it. > > I'm not pushing my pet language, I'm holding it up as an example of how > you could make a new programming language. Take what works, extend it > with what works. Preserve people's knowledge rather than force them to > learn a new syntax that does the same thing. Extend rather than replace. C's Achilles' heel is its type system. No extension can paper over that. Once you fix that it is no longer C. Newer languages can in fact be seen as fixing/adding things like proper strings, default read/write rules, memory allocation rules, modules/packages, object/closures, etc. over C. This is why I said they are not particularly outstanding. A lot of it seems rather adhoc. From tuhs at tuhs.org Mon Mar 3 03:54:01 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Sun, 02 Mar 2025 17:54:01 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> Message-ID: On Sunday, March 2nd, 2025 at 9:29 AM, Bakul Shah via TUHS wrote: > On Mar 1, 2025, at 7:16 PM, Larry McVoy lm at mcvoy.com wrote: > > > > Anyway, my main point was that please do something innovative and > > > interesting if you must use a new HLL for an OS! > > > > So little-lang wasn't for OS stuff, it was for userland, but I feel it was > > plenty innovative. It fit into C syntax but added a bunch of useful stuff. > > string type that managed the memory in the language, no more malloc/free > > crap. Case statements that could take variables, regexp as switches. > > Perl like I/O with regex built in. It was just a pile of pleasant > > enhancements to C. > > > My comment was about innovation in the OS, not about any HLL. > > > And it could be an OS language, I don't see why not. > > > Sure. > > > If I had enough money, I'd fund a gcc --lang=Little syntax and the world > > would be a better place. In. My. Opinion. Not everyone elses but I > > really believe if gcc had that dialect a lot of C people would move to it. > > > > I'm not pushing my pet language, I'm holding it up as an example of how > > you could make a new programming language. Take what works, extend it > > with what works. Preserve people's knowledge rather than force them to > > learn a new syntax that does the same thing. Extend rather than replace. > > > C's Achilles' heel is its type system. No extension can paper over > that. Once you fix that it is no longer C. Newer languages can in > fact be seen as fixing/adding things like proper strings, default > read/write rules, memory allocation rules, modules/packages, > object/closures, etc. over C. This is why I said they are not > particularly outstanding. A lot of it seems rather adhoc. For me though that speaks to the fact that C leaves a lot to the imagination. You've got the building blocks of a number of higher level concepts, but implementation thereof is left to the user. Of course you have these various extensions pop up, but the base language does not define the "one true way" to do it, rather, each case can be engineered with context in mind. I personally wouldn't see a one size fits all "upgrade" of the language gaining steam in this regard, partially because it wouldn't need to. C is great at what it is for, keeping it simple ensures a solid foundation on which more complex systems can be built without painting any given concern into a corner. But you know what they say, with great power comes great responsibility. If folks don't need that power or don't want that responsibility, there are a glut of other languages, both general and focused, for whatever need they may have that C cannot meet. To spin another quote ask not what your language of choice can do for you, ask what you can do for your language of choice. In other words, the growth of new features in a language doesn't always need to be an addition to the specification, it instead can be novel use that is then shared with others to the benefit of all that language's adherents. - Matt G. From g.branden.robinson at gmail.com Mon Mar 3 04:39:05 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Sun, 2 Mar 2025 12:39:05 -0600 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> Message-ID: <20250302183905.ibxuwqyvj4pnrihy@illithid> At 2025-03-02T17:54:01+0000, segaloco via TUHS wrote: > On Sunday, March 2nd, 2025 at 9:29 AM, Bakul Shah via TUHS > wrote: > > C's Achilles' heel is its type system. No extension can paper over > > that. Once you fix that it is no longer C. Newer languages can in > > fact be seen as fixing/adding things like proper strings, default > > read/write rules, memory allocation rules, modules/packages, > > object/closures, etc. over C. This is why I said they are not > > particularly outstanding. A lot of it seems rather adhoc. > > For me though that speaks to the fact that C leaves a lot to the > imagination. You've got the building blocks of a number of higher > level concepts, but implementation thereof is left to the user. Of > course you have these various extensions pop up, but the base language > does not define the "one true way" to do it, rather, each case can be > engineered with context in mind. If I'm understanding Bakul's point correctly, then what you're saying is a defensible claim about many aspects of C but emphatically _not_ of its type system. The language does _not_ give you the tools to implement a rigorous type system as, Haskell programmers or type theorists would conceive of it. I mean, you could write a Haskell compiler in C, but you wouldn't be able to apply its type logic to C's own type system. (Maybe you could do it for any types other than the primitive ones already supported, with sufficient abuse of the preprocessor.) This rigid inability was baked into C from day one. Everything that can possibly be represented in a machine register with all bits clear shows up as an integral zero literal. '\0' == 0 == nullptr == struct { } == union { } The CSRC staked a lot on the typeless B, and clung to that heritage. I don't think it's an accident that C's designers expressed type _aliasing_ with the keyword "typedef". Types simply were not taken seriously as reasoning tools. C was a language for people who wanted to get that crap out of the way so they could think about binary representations. (Okay, octal.) C was aggressively oversold, I think more by by its fans than its designers. Even its reputation as "portable assembler" withstands scrutiny only as long as all you want to port to are models in the PDP-11 line. If you think that claim is crack-headed, review London & Reiser's paper on porting Unix to the VAX-11/780. Even setting aside the problems with nroff/troff and the Bourne shell, they pointed their fingers at problems that a "portable assembly language" would already have considered, like machines having differing alignment requirements, or having the standard I/O library's binary I/O functions write and interpret some kind of header in the file expressing basic facts like the endianness of various integer widths (big, little, or FP-11). Magic numbers were good enough for object file formats, but not applications programmers, I guess. I don't doubt that people could be massively more productive in C than in PDP-11 assembly language. But that claim doesn't establish anything that wasn't already known about the utility of higher-level languages. K&R were still trying to warn people in 1988 that C was not a hammer for all nails. "...C offers only straightforward, single-thread control flow: tests, loops, grouping, and subprograms, but not multiprogramming, parallel operations, synchronization, or coroutines." (_The C Programming Language, 2nd edition, p. 2) I guess we can add a general, flexible algebraic type system to the list of missing features. Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From rich.salz at gmail.com Mon Mar 3 05:09:37 2025 From: rich.salz at gmail.com (Rich Salz) Date: Sun, 2 Mar 2025 14:09:37 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302154623.GG13234@mcvoy.com> References: <20250301220111.GB13234@mcvoy.com> <20250302011740.GE31995@mcvoy.com> <20250302020737.aw6ef53l6h63pmew@illithid> <20250302154623.GG13234@mcvoy.com> Message-ID: On Sun, Mar 2, 2025 at 10:46 AM Larry McVoy wrote: > Hanging out there gave me insight into what all > those execs did. It's very different from what an engineer does, we > figure stuff out to the Nth degree. Very nice note. -------------- next part -------------- An HTML attachment was scrubbed... URL: From woods at robohack.ca Mon Mar 3 06:51:26 2025 From: woods at robohack.ca (Greg A. Woods) Date: Sun, 02 Mar 2025 12:51:26 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> Message-ID: At Sat, 1 Mar 2025 19:00:03 -0800, Bakul Shah via TUHS wrote: Subject: [TUHS] Re: Any UNIX With No C In Userland? > > .... C is a great portable > assembler but not so great as a high level language. Unfortunately modern C isn't a a great portable "assembler" any more. V is probably one of the better new low-level programming languages. Development of V has hardly stalled -- it's highly active from what I see on Github. I personally think V is very easy to learn for C programmers, with lots of similarities, and it's very easy to translate C to V too! Hare is another interesting new compiled systems-level language. Both are promising "No Undefined Behaviour" -- since of course the main Sin Of Modern C is it's all-too-abusive reliance on Undefined Behaviour. Possibly Jai will be in this realm as well. Hmm.... Nim? Maybe. Regarding the question starting this part of the thread: Any UNIX with no C In Userland? Well there was TUNIS (https://en.wikipedia.org/wiki/TUNIS). I don't have quick access to my copy of the book about it, but I seem to recall that there may have been some attempts to be ABI-compatible with 7th Edition Unix, at least on PDP-11. That Wikipedia article does say: Programs that ran under Unix V7 could be run under TUNIS with no modification. But that may have only meant API-compatible. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From douglas.mcilroy at dartmouth.edu Mon Mar 3 07:45:40 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Sun, 2 Mar 2025 16:45:40 -0500 Subject: [TUHS] Language-lawyers' field day (was Any UNIX With No C In Userland?) Message-ID: > Everything that can possibly be represented in a machine > register with all bits clear shows up as an integral zero literal. > '\0' == 0 == nullptr == struct { } == union { } Well, some things. 0.0f and other floating-point zero constants are represented by all-zero words (of various sizes) and are not integral constants. NULL does not "show up as an integral zero literal". 0==NULL is true only because 0 can be converted to NULL. Getting really lawyerly, one can cook up any number of bizarre "things that can possibly be represented" by an all-zero word, for example (char[4]){0,0,0,0}, and have no representation as an integral constant. Only 3 of the 5 examples fit the description of possibly being represented by an all-zero word. struct{} and union{} are gnu extensions with size zero. Even if you accept them as C, they have no machine representation and cannot be cast to int. The null pointer makes the list only thanks to the weasel-word "possibly". Although 0 can be cast to the null pointer, the result of casting a null pointer to int depends on its unspecified machine representation. Zero, of course, is a good choice because it's easy to test for, and is easy to omit from virtual address spaces. Doug From kevin.bowling at kev009.com Mon Mar 3 08:43:06 2025 From: kevin.bowling at kev009.com (Kevin Bowling) Date: Sun, 2 Mar 2025 15:43:06 -0700 Subject: [TUHS] AIX (RT/PC & RS/6000) 1/2/3 compilers [was Re: Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: On Sat, Mar 1, 2025 at 5:11 PM Charles H Sauer (he/him) wrote: > > > > On 3/1/2025 5:46 PM, segaloco via TUHS wrote: > > > > > But I digress, I seem to recall reading in another thread here that AIX may have had a fair deal of IBM stuff like perhaps some PL/I or PL/S down in the guts of significant chunks at one point, but I couldn't speak to that with any authority. I could see IBM what with their legacy in languages bristling at letting C be the star of the show. > > > > - Matt G. > > For all our faults in the AIX team and IBM in general, there was no > desire to have legacy IBM languages as primary in AIX. > > https://notes.technologists.com/notes/2017/03/08/lets-start-at-the-very-beginning-801-romp-rtpc-aix-versions/ > tries to explain why PL.8 code existed in early AIX and how PL.8 code > was eliminated in AIX 3 development. > > AIX for the RT 1&2 bundled pcc (with the HCR optimization phase in AIX 2). > > Since I left IBM in the midst of AIX 3 development, I'm not certain what > happened with compilers after I left. Part of the confusion was IBM > Toronto rewriting the Yorktown Research compiler to be a "product > worthy" C compiler. There may have been desire to gain revenue for the > Toronto compiler, but I assume that some some C compiler was bundled in > AIX 3. AIX 3 still shipped with a compiler (see, for instance the N40 media https://www.ardent-tool.com/RS6000/7007/AIX.html) although I've never bothered to see if it was separately licensed or not (there is no key/drm). AIX 4 divorced the compiler from base went under a few names over the years: C fo AIX, xl C/C++, VisualAge C/C++. Lately things were rebased onto a clang frontend and called Open XL. The VisualAge one is kind of interesting because it includes the VisualAge SmallTalk derived IDE which eventually spawned the Eclipse IDE project. Slightly less clunky without the Java. > Charlie > > -- > voice: +1.512.784.7526 e-mail: sauer at technologists.com > fax: +1.512.346.5240 Web: https://technologists.com/sauer/ > Facebook/Google/LinkedIn/mas.to: CharlesHSauer > From bear at typewritten.org Mon Mar 3 09:03:09 2025 From: bear at typewritten.org (r.stricklin) Date: Sun, 2 Mar 2025 15:03:09 -0800 Subject: [TUHS] AIX (RT/PC & RS/6000) 1/2/3 compilers [was Re: Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <2241C6AE-8385-4864-AAAD-517CA297E524@typewritten.org> > On Mar 2, 2025, at 2:43 PM, Kevin Bowling wrote: > > AIX 3 still shipped with a compiler (see, for instance the N40 media > https://www.ardent-tool.com/RS6000/7007/AIX.html ) although I've never > bothered to see if it was separately licensed or not (there is no > key/drm). The N40 CD media may have included the compiler, but it also included a lot of other unbundled software features (e.g. AIXwindows, fax support, LocalTalk, WABI), so I don’t think this should be the benchmark. I’ve personally catalogged many sets of AIX 3 install tapes with only the various language runtime(s), and no compiler present. The tapes were generated for each customer and included only the features that were ordered. ok bear. -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Mon Mar 3 09:25:39 2025 From: crossd at gmail.com (Dan Cross) Date: Sun, 2 Mar 2025 18:25:39 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250302183905.ibxuwqyvj4pnrihy@illithid> References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: On Sun, Mar 2, 2025 at 1:46 PM G. Branden Robinson wrote: > At 2025-03-02T17:54:01+0000, segaloco via TUHS wrote: >[snip] > If I'm understanding Bakul's point correctly, then what you're saying is > a defensible claim about many aspects of C but emphatically _not_ of its > type system. The language does _not_ give you the tools to implement a > rigorous type system as, Haskell programmers or type theorists would > conceive of it. I mean, you could write a Haskell compiler in C, but > you wouldn't be able to apply its type logic to C's own type system. > (Maybe you could do it for any types other than the primitive ones > already supported, with sufficient abuse of the preprocessor.) > > This rigid inability was baked into C from day one. Everything that can > possibly be represented in a machine register with all bits clear shows > up as an integral zero literal. > > '\0' == 0 == nullptr == struct { } == union { } > > The CSRC staked a lot on the typeless B, and clung to that heritage. > I don't think it's an accident that C's designers expressed type > _aliasing_ with the keyword "typedef". Types simply were not taken > seriously as reasoning tools. C was a language for people who wanted to > get that crap out of the way so they could think about binary > representations. (Okay, octal.) I'm sure I've said something like this before, but I do not think this criticism is fair. Many of the design aspects of C were dictated by the context in which it was developed, and if judging it in 2025, we have to bear that in mind. Recall that it had to fit on a machine that is laughably small by today's standards, and pretty small even in 1972; that imposed real constraints on its structure and semantics. Did other systems-oriented languages in that era have significantly richer type systems? Would a compiler for such a language even be runnable on Unix in the very early 70s? That's a serious question. Incidentally, the keyword for creating a type alias in Haskell is just `type`. Same in SML, OCaml, and Rust. I don't know that `typedef` is appreciably different, nor that it can credibly be cited as some sort of "tell" about a cavalier attitude towards types. > C was aggressively oversold, I think more by by its fans than its > designers. For the sorts of things one used C for throughout the 70s and into the 80s, what were the alternatives? > Even its reputation as "portable assembler" withstands > scrutiny only as long as all you want to port to are models in the > PDP-11 line. If you think that claim is crack-headed, review London & > Reiser's paper on porting Unix to the VAX-11/780. Even setting aside > the problems with nroff/troff and the Bourne shell, they pointed their > fingers at problems that a "portable assembly language" would already > have considered, like machines having differing alignment requirements, I didn't read that in their paper. They pointed out four enhancements to C, related to types, that they felt would enhance portability. Interestingly, all four have been implemented in more modern dialects of the language, though the alignment thing is still a bit perilous; I can't quite tell whether they were referring to "packed" structures (largely irrelevant when programs are written against a well-defined ABI) or something closer to `alignas()`/`alignof()`. They did indicate that alignment makes sharing _binary_ data between VAX and PDP-11 harder, but that's true of other aspects of product types as well. > or having the standard I/O library's binary I/O functions write and > interpret some kind of header in the file expressing basic facts like > the endianness of various integer widths (big, little, or FP-11). Magic > numbers were good enough for object file formats, but not applications > programmers, I guess. This really doesn't seem like it's the job of the IO library. For that matter, I don't think more recent languages delegate this to their IO routines, either. I'm also not sure that's what they said (at least, that's not how I interpreted their comments). > I don't doubt that people could be massively more productive in C than > in PDP-11 assembly language. But that claim doesn't establish anything > that wasn't already known about the utility of higher-level languages. > > K&R were still trying to warn people in 1988 that C was not a hammer for > all nails. > > "...C offers only straightforward, single-thread control flow: tests, > loops, grouping, and subprograms, but not multiprogramming, parallel > operations, synchronization, or coroutines." (_The C Programming > Language, 2nd edition, p. 2) > > I guess we can add a general, flexible algebraic type system to the list > of missing features. Sure. But then we're back where we started, talking about a language that only has a passing resemblance to C. - Dan C. From gingell at computer.org Mon Mar 3 18:34:08 2025 From: gingell at computer.org (Rob Gingell) Date: Mon, 3 Mar 2025 00:34:08 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250301233805.GC13234@mcvoy.com> References: <20250301233805.GC13234@mcvoy.com> Message-ID: <3ea0b8a2-de64-4b15-9b0e-c497af27fd50@computer.org> On 3/1/25 3:38 PM, Larry McVoy wrote: > Rob Gingell would likely know. Time has fogged the memories but I'll try. > On Sat, Mar 01, 2025 at 06:28:58PM -0500, Clem Cole wrote: >> IIRC: Sun continued to bundled a simple C compiler so you build the kernel, >> but it was trying to make $s on the compiler suite. >> >> SW economics can be difficult. As has been noted, SunOS 4.x continued to ship a cc sufficient to build the kernel but which didn't otherwise track the language evolution going on. The unbundled tools were introduced to deliver those and, in a theory about the value of software, to earn the necessary investments by generating a return. It wasn't just marketing being revenue vampires though it presented an opportunity for feeding. There was a good bit of investment / change in the tools in this period. The SVR4 work brought the ANSI C front end from AT&T, FORTRAN technology purchases/licenses occurred, there was something going on with Pascal that I don't recall at the moment. Some of the investment was necessitated by the shift to SPARC and the demands that RISC posed on the code generation software. The appeal that these things must be done because they're necessary to have good and credible products was countered with a business sentiment that the value should manifest as a return visible in $. So I recall it as more textured than just "let's squeeze some $'s" but especially if you have the view that "C is part of the system" it's an affront. On 3/1/25 5:17 PM, Larry McVoy wrote: > The funny thing was that somewhere around then Sun Labs was paying Micheal > Tiemann to make g++ work. With a deal that let him retain the rights to > the code. I never understood that, one hand wants to charge for cc and > the other hand is paying for free g++? The ways of Sun could be strange. One of Sun's charms was a "let many flowers bloom, even if they contradict themselves" kind of culture. Sometimes this yielded successes and opportunities. Sometimes it resulted in frictions and periodic culls. (The "all the wood behind one arrow" SPARC focus was one such consequence, when the array of 68K, 386, and SPARC products got too logistically cacophonous and Sun blew a revenue projection in [I think] the last FY1989 quarter. The cacophony wasn't in engineering so much as the rest of the business: sales forecasting, manufacturing, support, ISV relationships. etc.) "Many flowers bloom" thinking wasn't restricted to just technology and extended to business models. The thinking that "investments should follow the revenue" visible in the tools unbundling along with additional influences led to the "planets" reorganization of Sun (e.g., SunSoft, SunPro, etc.) where inter-unit pricing and therefore revenues would demonstrate where value was derived. Sun's R&D levels were always higher than conventional business wisdom said they should be so rationalizing them drove such thinking. Whatever the merits of the planets model in theory were, they were completely lost in the execution. The implementation looked more like the State Planning Committee of the Soviet Union than Wall Street. The company focused inward rather than outward to the market for a while before the whole thing was relaxed back to something more "normal". Definitely one of the worst self-inflicted wounds in the company's history. IMO, such failures emerged from the same environment that permitted the exploration of unusual ideas and the taking of big swings. Sun was probably better off and certainly more impactful, fun, and interesting for being that way even at the cost of some dramatic face plants. From peter.martin.yardley at gmail.com Mon Mar 3 21:31:13 2025 From: peter.martin.yardley at gmail.com (Peter Yardley) Date: Mon, 3 Mar 2025 22:31:13 +1100 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: Problems I have with C as a systems language is there is no certainty about representation of a structure in memory and hence when it is written to disk. I will be happy to be corrected but I remember this behaviour to be compiler dependant. Other languages such as Bliss and to perhaps a lesser degree Pascal had implicit control of this. Having said that I worked with C as a systems language many times. > On 3 Mar 2025, at 10:25 am, Dan Cross wrote: > > On Sun, Mar 2, 2025 at 1:46 PM G. Branden Robinson > wrote: >> At 2025-03-02T17:54:01+0000, segaloco via TUHS wrote: >> [snip] >> If I'm understanding Bakul's point correctly, then what you're saying is >> a defensible claim about many aspects of C but emphatically _not_ of its >> type system. The language does _not_ give you the tools to implement a >> rigorous type system as, Haskell programmers or type theorists would >> conceive of it. I mean, you could write a Haskell compiler in C, but >> you wouldn't be able to apply its type logic to C's own type system. >> (Maybe you could do it for any types other than the primitive ones >> already supported, with sufficient abuse of the preprocessor.) >> >> This rigid inability was baked into C from day one. Everything that can >> possibly be represented in a machine register with all bits clear shows >> up as an integral zero literal. >> >> '\0' == 0 == nullptr == struct { } == union { } >> >> The CSRC staked a lot on the typeless B, and clung to that heritage. >> I don't think it's an accident that C's designers expressed type >> _aliasing_ with the keyword "typedef". Types simply were not taken >> seriously as reasoning tools. C was a language for people who wanted to >> get that crap out of the way so they could think about binary >> representations. (Okay, octal.) > > I'm sure I've said something like this before, but I do not think this > criticism is fair. Many of the design aspects of C were dictated by > the context in which it was developed, and if judging it in 2025, we > have to bear that in mind. Recall that it had to fit on a machine that > is laughably small by today's standards, and pretty small even in > 1972; that imposed real constraints on its structure and semantics. > Did other systems-oriented languages in that era have significantly > richer type systems? Would a compiler for such a language even be > runnable on Unix in the very early 70s? That's a serious question. > Incidentally, the keyword for creating a type alias in Haskell is just > `type`. Same in SML, OCaml, and Rust. I don't know that `typedef` is > appreciably different, nor that it can credibly be cited as some sort > of "tell" about a cavalier attitude towards types. > >> C was aggressively oversold, I think more by by its fans than its >> designers. > > For the sorts of things one used C for throughout the 70s and into the > 80s, what were the alternatives? > >> Even its reputation as "portable assembler" withstands >> scrutiny only as long as all you want to port to are models in the >> PDP-11 line. If you think that claim is crack-headed, review London & >> Reiser's paper on porting Unix to the VAX-11/780. Even setting aside >> the problems with nroff/troff and the Bourne shell, they pointed their >> fingers at problems that a "portable assembly language" would already >> have considered, like machines having differing alignment requirements, > > I didn't read that in their paper. They pointed out four enhancements > to C, related to types, that they felt would enhance portability. > Interestingly, all four have been implemented in more modern dialects > of the language, though the alignment thing is still a bit perilous; I > can't quite tell whether they were referring to "packed" structures > (largely irrelevant when programs are written against a well-defined > ABI) or something closer to `alignas()`/`alignof()`. They did indicate > that alignment makes sharing _binary_ data between VAX and PDP-11 > harder, but that's true of other aspects of product types as well. > >> or having the standard I/O library's binary I/O functions write and >> interpret some kind of header in the file expressing basic facts like >> the endianness of various integer widths (big, little, or FP-11). Magic >> numbers were good enough for object file formats, but not applications >> programmers, I guess. > > This really doesn't seem like it's the job of the IO library. For that > matter, I don't think more recent languages delegate this to their IO > routines, either. I'm also not sure that's what they said (at least, > that's not how I interpreted their comments). > >> I don't doubt that people could be massively more productive in C than >> in PDP-11 assembly language. But that claim doesn't establish anything >> that wasn't already known about the utility of higher-level languages. >> >> K&R were still trying to warn people in 1988 that C was not a hammer for >> all nails. >> >> "...C offers only straightforward, single-thread control flow: tests, >> loops, grouping, and subprograms, but not multiprogramming, parallel >> operations, synchronization, or coroutines." (_The C Programming >> Language, 2nd edition, p. 2) >> >> I guess we can add a general, flexible algebraic type system to the list >> of missing features. > > Sure. But then we're back where we started, talking about a language > that only has a passing resemblance to C. > > - Dan C. .1.3.6.1.4.1.8852.4.2 Peter Yardley peter.martin.yardley at gmail.com From paul.winalski at gmail.com Tue Mar 4 02:59:42 2025 From: paul.winalski at gmail.com (Paul Winalski) Date: Mon, 3 Mar 2025 11:59:42 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: On Mon, Mar 3, 2025 at 6:51 AM Peter Yardley wrote: > Problems I have with C as a systems language is there is no certainty > about representation of a structure in memory and hence when it is written > to disk. I believe you are correct. Alignment and padding within structs in C is implementation-dependent. > I will be happy to be corrected but I remember this behaviour to be > compiler dependant. Other languages such as Bliss and to perhaps a lesser > degree Pascal had implicit control of this. > > In the case of BLISS, I would call it explicit control. BLISS is a peculiar language in that it has only one data type, the BLISS value which is a contiguous set of bits of fixed length, typically the size of the target machine word. There are four dialects of BLISS that differed mainly in the size of the BLISS value: 16 for BLISS-16 (for the PDP-11, 32 for BLISS-32 (VAX, x86), 36 for BLISS-36 (PDP-10), and 64 for BLISS-64 (Alpha, Itanium, x86-64). Data typing in BLISS is a function of the operations performed on the BLISS value rather than a property of the value itself. BLISS is also unusual in that data are retrieved from memory or registers using an explicit fetch operator. There is no distinction between lvalues and rvalues. BLISS is also an expression language rather than a statement language. Everything, including procedural code, is an expression with a BLISS value and can be used as such. For example: a = (if .b then .c else .d) This code fetches the BLISS value whose address is b ('.' is the fetch operator). The IF expression treats this fetched value as a logical true/false. If the value is true, the value whose address is c is fetched, otherwise the value whose address is d is fetched. The value of IF expression is the fetched value. The = expression then stores that value at the address represented by a. The stored value is also the value of the = expression itself. The semicolon (;) is an expression separator with the semantics "discard the current expression value". It is common practice to write procedural code with operations separated by semicolons, which gives you a syntax similar to statement-oriented languages. But you don't have to code that way. BLISS does have data aggregates, both vectors (one-dimensional arrays of BLISS values) and C-style structs. The latter allows one to attach identifier names to pieces of the aggregate, supplying the same information as with the fetch operator (starting bit within the BLISS value, length in bits, whether or not it's to be sign-extended). In C the programmer merely specifies the order of the fields within a struct and their data types. The compiler actually lays out the positions of the fields. In BLISS the programmer must explicitly position the fields and specify their lengths. -------------- next part -------------- An HTML attachment was scrubbed... URL: From stewart at serissa.com Tue Mar 4 03:38:33 2025 From: stewart at serissa.com (Lawrence Stewart) Date: Mon, 3 Mar 2025 12:38:33 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> A bit of a story. At SiCortex, we were well aware that structure layout and bitfields were compiler dependent, yet we had real requirements that high (?) level languages be able to directly interact with hardware defined structures. So there was a chip spec that was the master truth about register layouts and hardware structures. Wilson Snyder (of Verilator fame) wrote a postprocessor that swallowed the chip spec document and spit out header files in various languages, one of which was C Preprocessor compatible #defines for field offset, position, and width. We programmers were careful to always use GET and SET macros to access objects with hardware specified layouts. IIRC there were additional macros to make sure that hardware accesses always used the proper machine instructions and fences. I don’t think the use of these macros cost much or any performance because they mostly resolved into shifts and masks with constants. It’s too bad that that C doesn’t have standardized rules for structures, because even when hardware isn’t involved, one must be extremely careful with cacheline alignment of fields in order to get good performance. __attribute__((aligned(64)) for the win, but it is ugly. I don’t think there is even a portable way to get the cacheline size. Then there is compiler register promotion, and volatile. -L > On Mar 3, 2025, at 11:59, Paul Winalski wrote: > > On Mon, Mar 3, 2025 at 6:51 AM Peter Yardley > wrote: >> Problems I have with C as a systems language is there is no certainty about representation of a structure in memory and hence when it is written to disk. > > I believe you are correct. Alignment and padding within structs in C is implementation-dependent. > >> I will be happy to be corrected but I remember this behaviour to be compiler dependant. Other languages such as Bliss and to perhaps a lesser degree Pascal had implicit control of this. >> > In the case of BLISS, I would call it explicit control. BLISS is a peculiar language in that it has only one data type, the BLISS value which is a contiguous set of bits of fixed length, typically the size of the target machine word. There are four dialects of BLISS that differed mainly in the size of the BLISS value: 16 for BLISS-16 (for the PDP-11, 32 for BLISS-32 (VAX, x86), 36 for BLISS-36 (PDP-10), and 64 for BLISS-64 (Alpha, Itanium, x86-64). > > Data typing in BLISS is a function of the operations performed on the BLISS value rather than a property of the value itself. BLISS is also unusual in that data are retrieved from memory or registers using an explicit fetch operator. There is no distinction between lvalues and rvalues. BLISS is also an expression language rather than a statement language. Everything, including procedural code, is an expression with a BLISS value and can be used as such. For example: > > a = (if .b then .c else .d) > > This code fetches the BLISS value whose address is b ('.' is the fetch operator). The IF expression treats this fetched value as a logical true/false. If the value is true, the value whose address is c is fetched, otherwise the value whose address is d is fetched. The value of IF expression is the fetched value. The = expression then stores that value at the address represented by a. The stored value is also the value of the = expression itself. The semicolon (;) is an expression separator with the semantics "discard the current expression value". It is common practice to write procedural code with operations separated by semicolons, which gives you a syntax similar to statement-oriented languages. But you don't have to code that way. > > BLISS does have data aggregates, both vectors (one-dimensional arrays of BLISS values) and C-style structs. The latter allows one to attach identifier names to pieces of the aggregate, supplying the same information as with the fetch operator (starting bit within the BLISS value, length in bits, whether or not it's to be sign-extended). In C the programmer merely specifies the order of the fields within a struct and their data types. The compiler actually lays out the positions of the fields. In BLISS the programmer must explicitly position the fields and specify their lengths. -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Tue Mar 4 03:55:10 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Mon, 03 Mar 2025 17:55:10 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> Message-ID: On Monday, March 3rd, 2025 at 9:38 AM, Lawrence Stewart wrote: > ... > It’s too bad that that C doesn’t have standardized rules for structures, because even when hardware isn’t involved, one must be extremely careful with cacheline alignment of fields in order to get good performance.  __attribute__((aligned(64)) for the win, but it is ugly.  I don’t think there is even a portable way to get the cacheline size. > > ... > > -L > Truth be told the subjectivity of implementing struct memory characteristics has bewildered me more rather than less as time goes on. What with the UNIX kernel being one of C's earliest applications, you find plenty of places where a struct is used to directly represent for instance a set of I/O registers on a peripheral (UART comes to mind). In this case, this would pretty much necessitate a 100% predictable layout of a structure in memory, otherwise you have no guarantee that the struct members you assign to specific registers (THR, SR, etc.) will always be aligned with those registers. This is an area where the "systems language" aspect does break down a bit broadly for me, if I can't describe a contiguous memory region in a structured way, just describe an abstract structure that various compilers put together however they see fit, this makes it much more difficult to use structures for anything other than abstract, in memory data in the context of a given process (assuming all code in that process comes from the same compiler). To effectively transfer data between systems you then need not only worry about endianness, but the ordering itself. Sure, UNIX was all about everything being text as much as possible, but there's something to be said about being able to avoid the cost of serialization of data when you know endianness isn't a problem. When compilers are free to clobber structures together in memory haphazardly, even sharing a CPU isn't a guarantee binary data will be portable. Of course not trying to lob unfair critique at the folks who had to make these decisions when constructing compilers, but it just surprises me there wasn't more impetus to ensure a structure represents not only the members but the memory characteristics of the data as it is being stored. That said, given that datatypes generally have minimum limits but are not *exact* across every system, there is the fact that using the same datatypes across different CPUs, even if the structures were very strict about no padding, order, whatever, you'd still have the potential for wildly different memory layouts on different systems. Not to mention systems requiring specific alignments regardless of data type. Maybe I've answered my own concern but I still find myself curious why there wasn't more of a push for memory predictability in a language being pushed as a systems language. It winds up being a happy accident, rather than a matter of intention, when extension-less ANSI C fits nicely over a set of hardware I/O registers... - Matt G. P.S. I will acquiesce that deeply embedded applications where memory details are quite important typically are not locked into one compiler ecosystem other than through vendor decisions, so it is much more feasible to leverage some specific extended C (GNU, llvm, etc.) for bare-metal projects where I/O register pages and such being perfectly mapped in structs matters. From lm at mcvoy.com Tue Mar 4 04:27:57 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 3 Mar 2025 10:27:57 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> Message-ID: <20250303182757.GR13234@mcvoy.com> On Mon, Mar 03, 2025 at 05:55:10PM +0000, segaloco via TUHS wrote: > Truth be told the subjectivity of implementing struct memory characteristics has > bewildered me more rather than less as time goes on. Alignment is your answer. Understand that and the confusion goes away: slovax ~/tmp cat pack.c #include struct { char a; int b; } foo; int main(void) { printf("%lu\n", sizeof(foo)); return (0); } slovax ~/tmp cc pack.c slovax ~/tmp a.out 8 Even x86, it would appear, wants to do aligned loads. I'm a little surprised by that though maybe I shouldn't be as there is a RISC implemented by the microcode under the x86 CPU. Does anyone know if gcc has an option to ignore alignment and pack the structs? From imp at bsdimp.com Tue Mar 4 04:29:59 2025 From: imp at bsdimp.com (Warner Losh) Date: Mon, 3 Mar 2025 11:29:59 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: On Mon, Mar 3, 2025, 10:00 AM Paul Winalski wrote: > On Mon, Mar 3, 2025 at 6:51 AM Peter Yardley < > peter.martin.yardley at gmail.com> wrote: > >> Problems I have with C as a systems language is there is no certainty >> about representation of a structure in memory and hence when it is written >> to dis5⅝k. > > > I believe you are correct. Alignment and padding within structs in C is > implementation-dependent. > Sizeof(int) is compiler dependent. uint32_t even isn't always 4 bytes. There's no way to say "packed" that is portable. Endian isn't specified so you have to cope with that. And bit field order is different for big/little Endian compiler modes, but that detail is just super common. It isn't mandated. And pointers aren't valid run to run And that's even before you try to save a string... it's a mess that have a lot of solutions to serialize... Warner > I will be happy to be corrected but I remember this behaviour to be >> compiler dependant. Other languages such as Bliss and to perhaps a lesser >> degree Pascal had implicit control of this. >> >> In the case of BLISS, I would call it explicit control. BLISS is a > peculiar language in that it has only one data type, the BLISS value which > is a contiguous set of bits of fixed length, typically the size of the > target machine word. There are four dialects of BLISS that differed mainly > in the size of the BLISS value: 16 for BLISS-16 (for the PDP-11, 32 for > BLISS-32 (VAX, x86), 36 for BLISS-36 (PDP-10), and 64 for BLISS-64 (Alpha, > Itanium, x86-64). > > Data typing in BLISS is a function of the operations performed on the > BLISS value rather than a property of the value itself. BLISS is also > unusual in that data are retrieved from memory or registers using an > explicit fetch operator. There is no distinction between lvalues and > rvalues. BLISS is also an expression language rather than a statement > language. Everything, including procedural code, is an expression with a > BLISS value and can be used as such. For example: > > a = (if .b then .c else .d) > > This code fetches the BLISS value whose address is b ('.' is the fetch > operator). The IF expression treats this fetched value as a logical > true/false. If the value is true, the value whose address is c is fetched, > otherwise the value whose address is d is fetched. The value of IF > expression is the fetched value. The = expression then stores that value > at the address represented by a. The stored value is also the value of the > = expression itself. The semicolon (;) is an expression separator with the > semantics "discard the current expression value". It is common practice to > write procedural code with operations separated by semicolons, which gives > you a syntax similar to statement-oriented languages. But you don't have > to code that way. > > BLISS does have data aggregates, both vectors (one-dimensional arrays of > BLISS values) and C-style structs. The latter allows one to attach > identifier names to pieces of the aggregate, supplying the same information > as with the fetch operator (starting bit within the BLISS value, length in > bits, whether or not it's to be sign-extended). In C the programmer merely > specifies the order of the fields within a struct and their data types. > The compiler actually lays out the positions of the fields. In BLISS the > programmer must explicitly position the fields and specify their lengths. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From imp at bsdimp.com Tue Mar 4 04:31:20 2025 From: imp at bsdimp.com (Warner Losh) Date: Mon, 3 Mar 2025 11:31:20 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250303182757.GR13234@mcvoy.com> References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: On Mon, Mar 3, 2025, 11:28 AM Larry McVoy wrote: > On Mon, Mar 03, 2025 at 05:55:10PM +0000, segaloco via TUHS wrote: > > Truth be told the subjectivity of implementing struct memory > characteristics has > > bewildered me more rather than less as time goes on. > > Alignment is your answer. Understand that and the confusion goes away: > > slovax ~/tmp cat pack.c > #include > > struct { > char a; > int b; > } foo; > > int > main(void) > { > printf("%lu\n", sizeof(foo)); > return (0); > } > > slovax ~/tmp cc pack.c > slovax ~/tmp a.out > 8 > > > Even x86, it would appear, wants to do aligned loads. I'm a little > surprised by that though maybe I shouldn't be as there is a RISC > implemented by the microcode under the x86 CPU. > > Does anyone know if gcc has an option to ignore alignment and pack the > structs? > __attribute__ ((__packed__)) Warner > -------------- next part -------------- An HTML attachment was scrubbed... URL: From steffen at sdaoden.eu Tue Mar 4 04:45:21 2025 From: steffen at sdaoden.eu (Steffen Nurpmeso) Date: Mon, 03 Mar 2025 19:45:21 +0100 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: <20250303184521.QKG9ks8K@steffen%sdaoden.eu> Warner Losh wrote in : |On Mon, Mar 3, 2025, 11:28 AM Larry McVoy wrote: |> On Mon, Mar 03, 2025 at 05:55:10PM +0000, segaloco via TUHS wrote: |>> Truth be told the subjectivity of implementing struct memory |> characteristics has |>> bewildered me more rather than less as time goes on. |> |> Alignment is your answer. Understand that and the confusion goes away: |> |> slovax ~/tmp cat pack.c |> #include |> |> struct { |> char a; |> int b; |>} foo; |> |> int |> main(void) |> { |> printf("%lu\n", sizeof(foo)); |> return (0); |>} |> |> slovax ~/tmp cc pack.c |> slovax ~/tmp a.out |> 8 |> |> |> Even x86, it would appear, wants to do aligned loads. I'm a little |> surprised by that though maybe I shouldn't be as there is a RISC |> implemented by the microcode under the x86 CPU. |> |> Does anyone know if gcc has an option to ignore alignment and pack the |> structs? |> | |__attribute__ ((__packed__)) __attribute__((packed, aligned(1))) I have forgotten why both. --steffen | |Der Kragenbaer, The moon bear, |der holt sich munter he cheerfully and one by one |einen nach dem anderen runter wa.ks himself off |(By Robert Gernhardt) From crossd at gmail.com Tue Mar 4 04:50:07 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 3 Mar 2025 13:50:07 -0500 Subject: [TUHS] Compilers (was Re: Any UNIX With No C In Userland?) In-Reply-To: References: Message-ID: On Sun, Mar 2, 2025 at 11:51 AM Doug McIntyre wrote: > On Sat, Mar 01, 2025 at 06:46:46PM -0500, Dan Cross wrote: > > On Sat, Mar 1, 2025 at 6:37 PM Warner Losh wrote: > >> Yes. I think so. But I also think that said C compiler wasn't adequate to bootstrap gcc or that there were extra steps / workarounds needed to do that. This was during the K&R -> ANSI cutover that Sun did this as well, and the old compiler was definitely K&R only. > > > > It's been years but I remember bootstrapping gcc with the `cc` that > > came with SunOS (was it in /usr/ucb? For some reason that sticks in my > > memory). I don't recall additional workaround steps, though perhaps > > one had to install other GNU utilities? > > > In that time period, the gcc distribution had a special bootstrap > version (older version?) of itself that was capable specificly of > being able to be compiled by the cc that SunOS shipped with that was > intended to compile the kernel bits shipped to configure in new limits/drivers, etc. > > The first stages of getting gcc compiled on SunOS was compiling that bootstrap > version of itself, and then using that bootstraped special version of gcc to then compile > the regular/up-to-date version of gcc. > > Eventually after some gcc major version, it stopped supporting/bundling this special bootstrap > version, and dropped support for bootstrapping itself up on SunOS. Huh. I totally believe you, but I have no memory of this. Perhaps I blanked it out; it does sound rather traumatic. - Dan C. From lars at nocrew.org Tue Mar 4 04:52:31 2025 From: lars at nocrew.org (Lars Brinkhoff) Date: Mon, 03 Mar 2025 18:52:31 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: (Paul Winalski's message of "Mon, 3 Mar 2025 11:59:42 -0500") References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: <7wfrjuj6a8.fsf@junk.nocrew.org> Paul Winalski wrote: > There are four dialects of BLISS that differed mainly in the size of > the BLISS value: 16 for BLISS-16 (for the PDP-11, 32 for BLISS-32 > (VAX, x86), 36 for BLISS-36 (PDP-10), and 64 for BLISS-64 (Alpha, > Itanium, x86-64). There was also BLISS-10, an earlier version for the PDP-10, and BLISS-11, a cross compiler targeting the PDP-11, hosted on a PDP-10. From imp at bsdimp.com Tue Mar 4 05:45:50 2025 From: imp at bsdimp.com (Warner Losh) Date: Mon, 3 Mar 2025 12:45:50 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250303184521.QKG9ks8K@steffen%sdaoden.eu> References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> <20250303184521.QKG9ks8K@steffen%sdaoden.eu> Message-ID: On Mon, Mar 3, 2025 at 11:45 AM Steffen Nurpmeso wrote: > Warner Losh wrote in > : > |On Mon, Mar 3, 2025, 11:28 AM Larry McVoy wrote: > |> On Mon, Mar 03, 2025 at 05:55:10PM +0000, segaloco via TUHS wrote: > |>> Truth be told the subjectivity of implementing struct memory > |> characteristics has > |>> bewildered me more rather than less as time goes on. > |> > |> Alignment is your answer. Understand that and the confusion goes away: > |> > |> slovax ~/tmp cat pack.c > |> #include > |> > |> struct { > |> char a; > |> int b; > |>} foo; > |> > |> int > |> main(void) > |> { > |> printf("%lu\n", sizeof(foo)); > |> return (0); > |>} > |> > |> slovax ~/tmp cc pack.c > |> slovax ~/tmp a.out > |> 8 > |> > |> > |> Even x86, it would appear, wants to do aligned loads. I'm a little > |> surprised by that though maybe I shouldn't be as there is a RISC > |> implemented by the microcode under the x86 CPU. > |> > |> Does anyone know if gcc has an option to ignore alignment and pack the > |> structs? > |> > | > |__attribute__ ((__packed__)) > > __attribute__((packed, aligned(1))) > > I have forgotten why both. > packed (or __packed__) says no space between items. aligned(1) means that the structure can start at any address. Warner -------------- next part -------------- An HTML attachment was scrubbed... URL: From usotsuki at buric.co Tue Mar 4 06:20:40 2025 From: usotsuki at buric.co (Steve Nickolas) Date: Mon, 3 Mar 2025 15:20:40 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250303182757.GR13234@mcvoy.com> References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: On Mon, 3 Mar 2025, Larry McVoy wrote: > Does anyone know if gcc has an option to ignore alignment and pack the > structs? Isn't it __attribute__((packed)) or something like that? -uso. From crossd at gmail.com Tue Mar 4 06:44:20 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 3 Mar 2025 15:44:20 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: On Mon, Mar 3, 2025 at 3:26 PM Steve Nickolas wrote: > > On Mon, 3 Mar 2025, Larry McVoy wrote: > > > Does anyone know if gcc has an option to ignore alignment and pack the > > structs? > > Isn't it __attribute__((packed)) or something like that? There are a few different ways to do that; some compilers have `#pragma pack(1)` to pack structs on byte boundaries. All are language extensions. Standard C, starting with (IIRC) C11, has the `_Alignas` keyword that lets you specify extended alignment for data; this can be useful in a struct or something. However, I just checked the latest C spec and it's quite explicit that you can't use it to relax alignment requirements to something less strict. That is, you can't portably say `alignas(1)` for an `int`. What Paul described from BLISS makes it sound like you could describe things like an IPv4 packet (which has fields that start at odd bit boundaries) exactly. That's neat. However, usually when programming at _that_ low of a level in C or Rust or something like that, you have an ABI you can rely on for guarantees about the size and alignment of fundamental types. If I'm porting to another machine, I have to be sure those assumptions hold on the target, or update as needed. I guess that's a bummer. Unix got away with being able to specify things the way it did for so long because they controlled the horizontal and the vertical: the compiler was symbiotic with the system. You knew that your C struct looked like the layout of the hardware registers for some device because you maintained the compiler in the same source tree. Those days are long gone, of course, hence much of the current consternation around portable C. - Dan C. From jnc at mercury.lcs.mit.edu Tue Mar 4 07:20:47 2025 From: jnc at mercury.lcs.mit.edu (Noel Chiappa) Date: Mon, 3 Mar 2025 16:20:47 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: <20250303212047.920C718C09C@mercury.lcs.mit.edu> > From: "G. Brandn Robinson" > C was a language for people who wanted to get that crap out of the way > so they could think about binary representations. Huh? Sometimes C gets in the way of doing that; see below. > From: Dan Cross > They did indicate that alignment makes sharing _binary_ data between > VAX and PDP-11 harder, but that's truerepresentation of other aspects of product > types as well. Alignment is just one aspect of low-level binary representation; there's also length (in bits), which is critically important in several problem domains; device registers have already been mentioned, but more below. > From: Peter Yardley > Problems I have with C as a systems language is there is no certainty > about representation of a structure in memory and hence when it is > written to disk. That's yet another one. The area I'm thinking of (and which I saw a lot of) is code to implement network protocols (and I'm fairly astounded that nobody else has mentioned this yet). One has to have _absolute_ control over how the bits are laid out in the packet (which of course might wind up in any one of countless other machine types) - which generally means how they are laid out in memory. The whole concept of C declarations is not rich enough to really deal with this properly. For each field in the header, one absolutely needs to be able to separately specify the syntax (e.g. size in bits) and semantics (unsigned integer, etc). And if you want the code to be portable, so that one set of sources will compile to working code on several different architctures, it gets worse. Device registers, already mentioned, often only have to run on one type of machine, but having protocol implementions run on a number of different machine types is really common. I came up with a way to do this, without _any_ #ifdefs (which I don't like, for a reason I won't get into) in most source files. Dealing with byte order issues was similarly dealt with (one can't deal with it just in types, really, without making the type specification, and the language, somewhat complicated). I know later C's got better about richer variable semantics and syntax selection than the circa 1985 ones I was working with, but I don't think it was ever made completely simple and orthogonal (e.g. 'signed/unsigned/boolean/etc char/short/long/quad/word/etc') as it should have been. Noel From woods at robohack.ca Tue Mar 4 07:57:39 2025 From: woods at robohack.ca (Greg A. Woods) Date: Mon, 03 Mar 2025 13:57:39 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <5574A68C-069D-4281-B66B-80541AF86B9C@iitbombay.org> <20250302021657.GD13234@mcvoy.com> <77C32157-6FC7-49D2-97B1-1B1A9E1E60C2@iitbombay.org> <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> Message-ID: At Mon, 3 Mar 2025 22:31:13 +1100, Peter Yardley wrote: Subject: [TUHS] Re: Any UNIX With No C In Userland? > > Problems I have with C as a systems language is there is no > certainty about representation of a structure in memory and hence > when it is written to disk. That might be true for some nit-pick-ity "Standard C", but it isn't true for most real implementations, at least in the realm of 'hosted' implementations. Real implementations do after all have to interface with real hardware. In fact it's not impossible, or even really hard, to define a struct, even with bitfields, that can be "portably" compiled for a number of different architectures and platforms, including with different word sizes and endianness and which will map directly onto an external binary data representation. It can get a bit ugly to look at, but it works. Between the compiler manual and the ABI manual it _should_ all be quite clear how to do it. Of course "Standard C" has only got worse as it has morphed along the way, growing increasingly thorny with more instead of less "Undefined Behaviour". -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From paul.winalski at gmail.com Tue Mar 4 08:00:18 2025 From: paul.winalski at gmail.com (Paul Winalski) Date: Mon, 3 Mar 2025 17:00:18 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250303182757.GR13234@mcvoy.com> References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: On Mon, Mar 3, 2025 at 1:28 PM Larry McVoy wrote: > > Even x86, it would appear, wants to do aligned loads. I'm a little > surprised by that though maybe I shouldn't be as there is a RISC > implemented by the microcode under the x86 CPU. > > I don't think there's ever been a CPU design where unaligned accesses run at the same speed as naturally aligned accesses. The worst case is an unaligned access that straddles a machine-word boundary. -Paul W. -------------- next part -------------- An HTML attachment was scrubbed... URL: From gingell at computer.org Tue Mar 4 08:51:06 2025 From: gingell at computer.org (Rob Gingell) Date: Mon, 3 Mar 2025 14:51:06 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> On 3/1/25 3:28 PM, Clem Cole wrote: > Funny thing, I > have always said huge reason BLISS lost was that DEC charged for $5000 > per CPU for it on TOPS or VMS, while  C was free with UNIX - even though > the difference is the resulting code was remarkable.   So many people > stayed away because they did not want to spend the extra $s. The price certainly was an impediment to adoption, and arguably so even within DEC. At CWRU we did a lot of things in Common BLISS (-36, -16) including changes to the TOPS-20 monitor. Some of those were bought back by DEC but an impediment to that was they didn't want any of the TOPS-20 source to be in BLISS. The reason given was that source customers didn't want the burden of purchasing the license but it was hard to tell whether that was the real reason vs. a cultural rejection within DEC. I forget how this was resolved but think it was a "take the generated code and add comments" solution -- that BLISS' code generation was pretty awesome made that workable if tedious. Common BLISS was a pretty flexible and rich development tool, it would have been interesting to see it gain more traction though I can't see it "winning" vs. C given how much traction C had. The earlier BLISS-10/-11 dialects were less compelling and quirkier. The UNIX-related part of this was that among the things we used BLISS for was a package called PAUNIX that attempted to implement the V7 UNIX interfaces using the mechanisms/form used to implement TOPS-10 compatibility (PA1050). Like the TOPS-10 compatibility it was a run-time veneer over TOPS-20 that mapped the UNIX functionality onto it and buffered the differences. I describe it as "attempted to" as it was successful as far as it got but it was a hobby project that got to about 80% done (meaning about 80% of the total effort remained) before life and other assignments preempted it. (Although, with my PiDP-10 running TOPS-20, a fun exercise in nostalgic futility might be to haul PAUNIX out of the closet and try to finish it.) From crossd at gmail.com Tue Mar 4 09:13:27 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 3 Mar 2025 18:13:27 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> Message-ID: On Mon, Mar 3, 2025 at 6:01 PM Rob Gingell wrote: > [snip] > The UNIX-related part of this was that among the things we used BLISS > for was a package called PAUNIX that attempted to implement the V7 UNIX > interfaces using the mechanisms/form used to implement TOPS-10 > compatibility (PA1050). Like the TOPS-10 compatibility it was a run-time > veneer over TOPS-20 that mapped the UNIX functionality onto it and > buffered the differences. I describe it as "attempted to" as it was > successful as far as it got but it was a hobby project that got to about > 80% done (meaning about 80% of the total effort remained) before life > and other assignments preempted it. > > (Although, with my PiDP-10 running TOPS-20, a fun exercise in nostalgic > futility might be to haul PAUNIX out of the closet and try to finish it.) As an aside, Ken Harrenstein wrote a C compiler for the PDP-10 and a support library that was "good enough" to port a number of GNU tools to TOPS-20, including bash: @i vers Indra, PANDA TOPS-20 Monitor 7.1(21733)-4 PANDA TOPS-20 Command processor 7.1(4453)-4 @sh bash$ echo $BASH_VERSION 1.12.1 bash$ bash$ exit exit @@ @ Kind of impressive, really. - Dan C. From gingell at computer.org Tue Mar 4 10:02:45 2025 From: gingell at computer.org (Rob Gingell) Date: Mon, 3 Mar 2025 16:02:45 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> Message-ID: <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> On 3/3/25 3:13 PM, Dan Cross wrote: > As an aside, Ken Harrenstein wrote a C compiler for the PDP-10 and a > support library that was "good enough" to port a number of GNU tools > to TOPS-20, including bash: Yes, the Harrenstein C was the one we'd expected to use and were playing with at the time. We'd tried the Snyder compiler and one other I don't recall right now as well. Glad to see it developed enough to get bash compiled, very cool, thanks for pointing it out! From luther.johnson at makerlisp.com Tue Mar 4 12:10:56 2025 From: luther.johnson at makerlisp.com (Luther Johnson) Date: Mon, 3 Mar 2025 19:10:56 -0700 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250302031646.GF13234@mcvoy.com> <41D3852D-865E-4DEF-B28A-D82A53D9FEC1@iitbombay.org> <20250302183905.ibxuwqyvj4pnrihy@illithid> <3034F84A-B1DA-4F59-A8BD-06749D20A959@serissa.com> <20250303182757.GR13234@mcvoy.com> Message-ID: Any CPU with only a byte-wide memory interface (e.g. 8088) doesn't care about bigger word alignments. But yes, if the CPU can do 16 and 32 and 64 bits at a time, then aligned accesses of bigger words can be one transaction instead of two. On 03/03/2025 03:00 PM, Paul Winalski wrote: > On Mon, Mar 3, 2025 at 1:28 PM Larry McVoy > wrote: > > > Even x86, it would appear, wants to do aligned loads. I'm a little > surprised by that though maybe I shouldn't be as there is a RISC > implemented by the microcode under the x86 CPU. > > I don't think there's ever been a CPU design where unaligned accesses > run at the same speed as naturally aligned accesses. The worst case > is an unaligned access that straddles a machine-word boundary. > > -Paul W. -------------- next part -------------- An HTML attachment was scrubbed... URL: From lars at nocrew.org Tue Mar 4 16:37:56 2025 From: lars at nocrew.org (Lars Brinkhoff) Date: Tue, 04 Mar 2025 06:37:56 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250303212047.920C718C09C@mercury.lcs.mit.edu> (Noel Chiappa's message of "Mon, 3 Mar 2025 16:20:47 -0500 (EST)") References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> Message-ID: <7w8qpljo6z.fsf@junk.nocrew.org> Noel Chiappa writes: > The area I'm thinking of (and which I saw a lot of) is code to implement > network protocols (and I'm fairly astounded that nobody else has mentioned > this yet). Network packets, tape records, disk images. It's a bit off-topic, so I have tried to stay silent. > One has to have _absolute_ control over how the bits are laid out > in the packet (which of course might wind up in any one of countless other > machine types) - which generally means how they are laid out in memory. > The whole concept of C declarations is not rich enough to really deal with > this properly. I NEVER EVER use C structs or bit fields for this. I get the raw octets (or whatever the byte size is), and shift and mask accordingly to get the data fields. From lars at nocrew.org Tue Mar 4 16:40:25 2025 From: lars at nocrew.org (Lars Brinkhoff) Date: Tue, 04 Mar 2025 06:40:25 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> (Rob Gingell's message of "Mon, 3 Mar 2025 14:51:06 -0800") References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> Message-ID: <7w4j09jo2u.fsf@junk.nocrew.org> > Although, with my PiDP-10 running TOPS-20, a fun exercise in nostalgic > futility might be to haul PAUNIX out of the closet and try to finish > it. Please! At the very least, can you put the code online? From lars at nocrew.org Tue Mar 4 16:46:13 2025 From: lars at nocrew.org (Lars Brinkhoff) Date: Tue, 04 Mar 2025 06:46:13 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> (Rob Gingell's message of "Mon, 3 Mar 2025 16:02:45 -0800") References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> Message-ID: <7wzfi1i98q.fsf@junk.nocrew.org> Rob Gingell wrote: > Dan Cross wrote: >> As an aside, Ken Harrenstein wrote a C compiler for the PDP-10 KCC, started by Kok Chen at the Stanford AI lab. > Yes, the Harrenstein C was the one we'd expected to use and were > playing with at the time. We'd tried the Snyder compiler and one other > I don't recall right now as well. Probably Jay Lepreau's PCC from Utah, or maybe Sargasso C. From tuhs at tuhs.org Wed Mar 5 01:07:04 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Tue, 4 Mar 2025 10:07:04 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> Message-ID: On 3/3/25 7:02 PM, Rob Gingell wrote: > Glad to see it developed enough to get bash compiled, very cool, thanks for > pointing it out! We expected a lot less from the system and compiler in the bash-1.12 days; that helped here. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From woods at robohack.ca Wed Mar 5 07:27:09 2025 From: woods at robohack.ca (Greg A. Woods) Date: Tue, 04 Mar 2025 13:27:09 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <7w8qpljo6z.fsf@junk.nocrew.org> References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> Message-ID: At Tue, 04 Mar 2025 06:37:56 +0000, Lars Brinkhoff wrote: Subject: [TUHS] Re: Any UNIX With No C In Userland? > > I NEVER EVER use C structs or bit fields for this. I get the raw octets > (or whatever the byte size is), and shift and mask accordingly to get > the data fields. Marshalling data in or out byte by byte is of course safe and is definitely the most portable way, and one can of course still read or write "record"-length amounts of bytes at one time (or multiples of records). It just can be a little more tedious. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From gingell at computer.org Wed Mar 5 07:55:26 2025 From: gingell at computer.org (Rob Gingell) Date: Tue, 4 Mar 2025 13:55:26 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <7wzfi1i98q.fsf@junk.nocrew.org> References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> <7wzfi1i98q.fsf@junk.nocrew.org> Message-ID: <96a12f63-f74c-4248-8a31-ff8dc8fa77e9@computer.org> On 3/3/25 10:46 PM, Lars Brinkhoff wrote: > Rob Gingell wrote: >> Yes, the Harrenstein C was the one we'd expected to use and were >> playing with at the time. We'd tried the Snyder compiler and one other >> I don't recall right now as well. > > Probably Jay Lepreau's PCC from Utah, or maybe Sargasso C. Yes, Jay's was I was thinking of. I'm embarrassed to have spaced on that, Jay was such a presence in both the TOPS-20 and UNIX worlds, thanks for reminding me. On 3/4/25 7:07 AM, Chet Ramey via TUHS wrote: > We expected a lot less from the system and compiler in the bash-1.12 days; > that helped here. How would those expectations differ from what V7 provided, as PAUNIX's ambitions only went that far (not that they couldn't be expanded but...) The thing is the Harrenstein C must've gotten more complete runtime support to run 1.12 and seems like that'd exceed V7 and make PAUNIX even more futile. On 3/3/25 10:40 PM, Lars Brinkhoff wrote: > Please! At the very least, can you put the code online? It seems likely to be just a novelty and of no real use (and given the above, not necessary) but putting it online would be fine. The code is copyrighted/licensed by CWRU (and in some cases, both CWRU and me) and that should get cleaned up. At the time this work was done in the early 1980's, we had what was really a naive and clumsy view of how to go about it all. By today's expectations it looks more restrictive than anyone intended. I'll go get that cleaned up and follow-up with you (Lars) about going forward. From robpike at gmail.com Wed Mar 5 09:54:01 2025 From: robpike at gmail.com (Rob Pike) Date: Wed, 5 Mar 2025 10:54:01 +1100 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> Message-ID: Not if you do it well. Plan 9 did it and it was easy. The notion that the struct layout must correspond to the hardware device's layout is about as non-portable as thinking can get. -rob On Wed, Mar 5, 2025 at 8:27 AM Greg A. Woods wrote: > At Tue, 04 Mar 2025 06:37:56 +0000, Lars Brinkhoff > wrote: > Subject: [TUHS] Re: Any UNIX With No C In Userland? > > > > I NEVER EVER use C structs or bit fields for this. I get the raw octets > > (or whatever the byte size is), and shift and mask accordingly to get > > the data fields. > > Marshalling data in or out byte by byte is of course safe and is > definitely the most portable way, and one can of course still read or > write "record"-length amounts of bytes at one time (or multiples of > records). It just can be a little more tedious. > > -- > Greg A. Woods > > Kelowna, BC +1 250 762-7675 RoboHack > Planix, Inc. Avoncote Farms > -------------- next part -------------- An HTML attachment was scrubbed... URL: From lm at mcvoy.com Wed Mar 5 13:15:42 2025 From: lm at mcvoy.com (Larry McVoy) Date: Tue, 4 Mar 2025 19:15:42 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> Message-ID: <20250305031542.GC24601@mcvoy.com> On Wed, Mar 05, 2025 at 10:54:01AM +1100, Rob Pike wrote: > The notion that the struct layout must correspond to the hardware device's > layout is about as non-portable as thinking can get. I agree but as a side note, I ported Lachman's (nee Convergent I believe) TCP/IP stack to ETA-10's Sys V OS and to SCO's v7 or whatever they had. Somewhere I have a note book where I laid out in structs every packet format with notes about what that packet did, I was learning. Not once did I think about packing, the structs somehow just worked on the machines I was working on. Maybe the TCP/IP guys knew about spacing in the structs. I get it, I was naive, but sometimes the structs work. From woods at robohack.ca Wed Mar 5 13:52:41 2025 From: woods at robohack.ca (Greg A. Woods) Date: Tue, 04 Mar 2025 19:52:41 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> Message-ID: At Wed, 5 Mar 2025 10:54:01 +1100, Rob Pike wrote: Subject: [TUHS] Re: Any UNIX With No C In Userland? > > The notion that the struct layout must correspond to the hardware device's > layout Not "must" -- "can". Of course that's just effectively using the compiler to do the marshalling -- at compile time. There are caveats of course -- one can lay out bit flags for a register in a struct, but one must not make a pointer to such a struct and point it at the register and expect bit manipulations to work -- even with -O0 you'll probably mess up any attempts to change bits predictably in that register in ways that the hardware expects. Of course the same problems occur if you try to point a word-sized integer, even one with a "volatile" qualifier, at a register and manipulate its bits with C bitwise expressions. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From cmhanson at eschatologist.net Wed Mar 5 16:01:53 2025 From: cmhanson at eschatologist.net (Chris Hanson) Date: Tue, 4 Mar 2025 22:01:53 -0800 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: <8549373B-99FC-4896-B369-1901690CD979@eschatologist.net> On Mar 1, 2025, at 1:51 PM, Lyndon Nerenberg (VE7TFX/VE6BBM) wrote: > > segaloco via TUHS writes: >> Given that anything that obeys the ABI and has assembler entries to the ker= >> nel >> can request services, it seems to me it would be possible to stand up a >> user-land without C being present. Have any UNIXen ever done this after th= >> e >> advent of C? > > SunOS 4.0 or 4.1 was when the Sun geniouses unbundled the C compiler > and made it a $$$ add on. That move single-handedly made GCC the > reference compiler moving forward. SunOS still included their K&R compiler for kernel tuning, as well as all of the header files. It was both sufficient for hacking and sufficient for bootstrapping GCC if you needed an ANSI compiler. -- Chris From jnc at mercury.lcs.mit.edu Wed Mar 5 16:23:20 2025 From: jnc at mercury.lcs.mit.edu (Noel Chiappa) Date: Wed, 5 Mar 2025 01:23:20 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> > From: Larry McVoy > Not once did I think about packing, the structs somehow just worked on > the machines I was working on. Maybe the TCP/IP guys knew about spacing > in the structs. Not really! Of the first 6 TCP/IP implementations: https://gunkies.org/wiki/TCP_and_IP_bake_offs only 1 was in C - and it was a relatively late one. The earliest ones were mostly in assembler (PDP-10 and PDP-11). Noel From tuhs at tuhs.org Wed Mar 5 18:38:14 2025 From: tuhs at tuhs.org (Konstantin Belousov via TUHS) Date: Wed, 5 Mar 2025 10:38:14 +0200 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250305031542.GC24601@mcvoy.com> References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> <20250305031542.GC24601@mcvoy.com> Message-ID: Sorry for dup, resent from the subscribed address. On Tue, Mar 04, 2025 at 07:15:42PM -0800, Larry McVoy wrote: > On Wed, Mar 05, 2025 at 10:54:01AM +1100, Rob Pike wrote: > > The notion that the struct layout must correspond to the hardware device's > > layout is about as non-portable as thinking can get. > > I agree but as a side note, I ported Lachman's (nee Convergent I believe) > TCP/IP stack to ETA-10's Sys V OS and to SCO's v7 or whatever they had. > Somewhere I have a note book where I laid out in structs every packet > format with notes about what that packet did, I was learning. > > Not once did I think about packing, the structs somehow just worked on > the machines I was working on. Maybe the TCP/IP guys knew about spacing > in the structs. > > I get it, I was naive, but sometimes the structs work. All questions about type sizes and alignment, as well as the layout of structures, are defined by platform ABI. Any useful compiler for C would follow ABI, and any non-C compiler hoping to interoperate with the system libraries, would provide some way to communicate using the ABI. Then, after some time, but long time ago, all Unix-like systems converged to just two practically observable ABIs: ILP32 and LP64. Both of them require 8bit bytes, byte addressability, natural alignment for the basic integer types, and equiality of uint8_t/unsigned char, uint16_t/short, uint32_t/long. >From there, it is easy to see that e.g. layout of most ip protocol packets can be trivially written as C structs. The only trouble there is the need to align whole incoming packets on suitable boundary, but e.g. for IP/TCP/UDP the 4-bytes is enough, and most hardware enforces it. From tuhs at tuhs.org Thu Mar 6 00:15:22 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Wed, 5 Mar 2025 09:15:22 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <96a12f63-f74c-4248-8a31-ff8dc8fa77e9@computer.org> References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> <7wzfi1i98q.fsf@junk.nocrew.org> <96a12f63-f74c-4248-8a31-ff8dc8fa77e9@computer.org> Message-ID: <58579e53-bb0f-4f0a-b9cb-67d22c711494@case.edu> On 3/4/25 4:55 PM, Rob Gingell wrote: > On 3/4/25 7:07 AM, Chet Ramey via TUHS wrote: >> We expected a lot less from the system and compiler in the bash-1.12 days; >> that helped here. > > How would those expectations differ from what V7 provided, as PAUNIX's > ambitions only went that far (not that they couldn't be expanded but...) > The thing is the Harrenstein C must've gotten more complete runtime support > to run 1.12 and seems like that'd exceed V7 and make PAUNIX even more futile. This would have been 1991-1992, so we were working on mostly 4.3 BSD (me) and SunOS (Brian). We didn't expect more than K&R from the compiler, even though we were primarily using gcc. It was before autoconf, so we rolled our own version to create a `sysdefs.h', and had code to choose between POSIX, USG, and BSD versions of functions. I don't think it would have compiled on anything older than possibly 4.2 BSD, and probably not that, so I think your speculation about the post-V7 runtime support is on the mark. Chet -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From henry.r.bent at gmail.com Thu Mar 6 00:26:14 2025 From: henry.r.bent at gmail.com (Henry Bent) Date: Wed, 5 Mar 2025 09:26:14 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <58579e53-bb0f-4f0a-b9cb-67d22c711494@case.edu> References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> <7wzfi1i98q.fsf@junk.nocrew.org> <96a12f63-f74c-4248-8a31-ff8dc8fa77e9@computer.org> <58579e53-bb0f-4f0a-b9cb-67d22c711494@case.edu> Message-ID: On Wed, 5 Mar 2025 at 09:15, Chet Ramey via TUHS wrote: > On 3/4/25 4:55 PM, Rob Gingell wrote: > > > On 3/4/25 7:07 AM, Chet Ramey via TUHS wrote: > >> We expected a lot less from the system and compiler in the bash-1.12 > days; > >> that helped here. > > > > How would those expectations differ from what V7 provided, as PAUNIX's > > ambitions only went that far (not that they couldn't be expanded but...) > > The thing is the Harrenstein C must've gotten more complete runtime > support > > to run 1.12 and seems like that'd exceed V7 and make PAUNIX even more > futile. > > This would have been 1991-1992, so we were working on mostly 4.3 BSD (me) > and SunOS (Brian). We didn't expect more than K&R from the compiler, even > though we were primarily using gcc. It was before autoconf, so we rolled > our own version to create a `sysdefs.h', and had code to choose between > POSIX, USG, and BSD versions of functions. I don't think it would have > compiled on anything older than possibly 4.2 BSD, and probably not that, > so I think your speculation about the post-V7 runtime support is on the > mark. I have bash-2.04 running on 4.1C BSD. I don't remember whether I used gcc-1.42 or gcc-2.81 to build it, but it was one of the two. In my experience 2.04 is the last version that will successfully build and run on many very older systems, though I believe there are a few where I was not able to get past 1.14. At least on 4.1C, I've successfully used 2.04 to run all sorts of configure scripts and I don't remember ever having run into significant issues. -Henry -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Thu Mar 6 00:29:46 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Wed, 5 Mar 2025 09:29:46 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: <6dc49d15-e771-444f-aa89-a7b1d07228b1@computer.org> <270ed8eb-8400-4e37-89a5-e5e7e3b5aa2d@computer.org> <7wzfi1i98q.fsf@junk.nocrew.org> <96a12f63-f74c-4248-8a31-ff8dc8fa77e9@computer.org> <58579e53-bb0f-4f0a-b9cb-67d22c711494@case.edu> Message-ID: <200122be-5db7-4b24-a19b-1f4e7a5e7b22@case.edu> On 3/5/25 9:26 AM, Henry Bent wrote: > This would have been 1991-1992, so we were working on mostly 4.3 BSD (me) > and SunOS (Brian). We didn't expect more than K&R from the compiler, even > though we were primarily using gcc. It was before autoconf, so we rolled > our own version to create a `sysdefs.h', and had code to choose between > POSIX, USG, and BSD versions of functions. I don't think it would have > compiled on anything older than possibly 4.2 BSD, and probably not that, > so I think your speculation about the post-V7 runtime support is on the > mark. > > > I have bash-2.04 running on 4.1C BSD. Nice! I haven't done much software archaeology like that. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From emu at e-bbes.com Thu Mar 6 00:50:07 2025 From: emu at e-bbes.com (emanuel stiebler) Date: Wed, 5 Mar 2025 09:50:07 -0500 Subject: [TUHS] DCJ-11 processor with 20k FPGA In-Reply-To: References: Message-ID: <3da18b77-d02d-4916-b272-2d2c925eec2c@e-bbes.com> On 2025-03-01 07:11, Jason Stevens via TUHS wrote: > I assume people have seen this? > > https://github.com/ryomuk/TangNanoDCJ11MEM/tree/main > > > It's capable of running Unix v1 & some limited amount of v6 among other > things. The FPGA in question the Tang Nano 20k is sub 30GBP delivered from > AliExpress. > > Kind of neat to combine a real processor with a simple FPGA implementation > of the hardware. I just had a look at it, but he doesn't show the code, which runs on the TangNano? From chopps at chopps.org Thu Mar 6 01:30:54 2025 From: chopps at chopps.org (Christian Hopps) Date: Wed, 05 Mar 2025 10:30:54 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: (Konstantin Belousov via TUHS's message of "Wed, 5 Mar 2025 10:38:14 +0200") References: <20250303212047.920C718C09C@mercury.lcs.mit.edu> <7w8qpljo6z.fsf@junk.nocrew.org> <20250305031542.GC24601@mcvoy.com> Message-ID: Konstantin Belousov via TUHS writes: > ... > Then, after some time, but long time ago, all Unix-like systems converged > to just two practically observable ABIs: ILP32 and LP64. Both of them > require 8bit bytes, byte addressability, natural alignment for the basic > integer types, and equiality of uint8_t/unsigned char, uint16_t/short, > uint32_t/long. I think you mean uint32_t/int; `long` is 32 bits in IPL32 and 64 bits in LP64, indeed the name LP64 stands for Long and Pointer 64 :) Thanks, Chris. From jnc at mercury.lcs.mit.edu Thu Mar 6 02:14:49 2025 From: jnc at mercury.lcs.mit.edu (Noel Chiappa) Date: Wed, 5 Mar 2025 11:14:49 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: <20250305161449.2BD1B18C0A6@mercury.lcs.mit.edu> > From: Rob Pike > The notion that the struct layout must correspond to the hardware > device's layout is about as non-portable as thinking can get. I'm confused; I thought device register layout is inherently about as non-portable a thing as one could have, generally. (Exceptions: 1) the device is basically a single chip, so interfaces on two machines might be essentially identical, if they use the same chip; 2) someone made a 68K card that plugged into a QBUS, so drivers on a PDP-11 and that 68K could be identical.) Or did you mean that one could somehow disassociate the struct layout and the details of the device (assuming it has addressable registers, as became common)? How (I'm missing it)? Noel From tuhs at tuhs.org Thu Mar 6 04:54:52 2025 From: tuhs at tuhs.org (Jason Stevens via TUHS) Date: Wed, 5 Mar 2025 18:54:52 -0000 Subject: [TUHS] DCJ-11 processor with 20k FPGA Message-ID: the code in the repo is for the FPGA, the processor that is strapped to the FPGA well it runs the real code. It's like the 'minimig' Amiga emulator platform, a real processor, and FPGA to do all the IO heavy lifting. So it's not 100% FPGA but you are executing code on a real processor so you aren't exactly full emulation either. And it doesn't cost a fortune, assuming you can find one of these ancient microprocessors. -----Original Message----- From: emanuel stiebler To: Jason Stevens; 'tuhs at tuhs.org' Sent: 3/5/25 2:50 PM Subject: Re: [TUHS] DCJ-11 processor with 20k FPGA On 2025-03-01 07:11, Jason Stevens via TUHS wrote: > I assume people have seen this? > > https://github.com/ryomuk/TangNanoDCJ11MEM/tree/main > > > It's capable of running Unix v1 & some limited amount of v6 among other > things. The FPGA in question the Tang Nano 20k is sub 30GBP delivered from > AliExpress. > > Kind of neat to combine a real processor with a simple FPGA implementation > of the hardware. I just had a look at it, but he doesn't show the code, which runs on the TangNano? From tuhs at tuhs.org Thu Mar 6 05:45:26 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Wed, 05 Mar 2025 19:45:26 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250305161449.2BD1B18C0A6@mercury.lcs.mit.edu> References: <20250305161449.2BD1B18C0A6@mercury.lcs.mit.edu> Message-ID: On Wednesday, March 5th, 2025 at 8:14 AM, jnc at mercury.lcs.mit.edu wrote: > > From: Rob Pike > > > > The notion that the struct layout must correspond to the hardware > > > device's layout is about as non-portable as thinking can get. > > > I'm confused; I thought device register layout is inherently about as > non-portable a thing as one could have, generally. > > (Exceptions: 1) the device is basically a single chip, so interfaces on two > machines might be essentially identical, if they use the same chip; 2) someone > made a 68K card that plugged into a QBUS, so drivers on a PDP-11 and that 68K > could be identical.) > > Or did you mean that one could somehow disassociate the struct layout and the > details of the device (assuming it has addressable registers, as became > common)? How (I'm missing it)? > > Noel I got thinking about this more after seeing Rob's reply, it does make sense the more I ponder on it now. While it'd be nice to hand that info to the language and have compilation handle all the details, given that endianness is already an issue, it's a steep order to expect any of the other details of mapping onto a set of registers to actually be easily facilitated. Basically unless the struct can perfectly capture *everything*, in other words, operand size, bit ordering, volatility, padding, and so on, the fact that any assumptions may be made at all precludes trusting a struct to be the means for describing a hardware interface. As much as it may pain one (such as myself...) to put in all the serialization logic for data, expecting any old C compiler to be built with that in mind when it isn't a requirement for the language may be a fool's errand. It is nice when it works out that way on a specific machine and your code need not be portable though. Basically it's a happy accident when it does work, but not an indictment of the language when it doesn't. Plus while C may have grown up with the PDP-11 UNIX kernel, it grew wings and soared in userland applications, of which interacting with hardware I/O registers isn't really an expected application. IBM tried the divorce of user and system code with the PL/I vs PL/S exercise, and one only need take a look around to see what impact that materially had on the longevity of the languages. This sort of discussion really makes me appreciate this list, these are perspectives I don't often find elsewhere outside of footnotes in places I wouldn't know to search for. - Matt G. From phil at ultimate.com Thu Mar 6 06:39:14 2025 From: phil at ultimate.com (Phil Budne) Date: Wed, 05 Mar 2025 15:39:14 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> Message-ID: <202503052039.525KdEoB093322@ultimate.com> Noel Chiappa wrote: > > From: Larry McVoy > > > Not once did I think about packing, the structs somehow just worked on > > the machines I was working on. Maybe the TCP/IP guys knew about spacing > > in the structs. > > Not really! Of the first 6 TCP/IP implementations: > only 1 was in C - and it was a relatively late one. The earliest ones were > mostly in assembler (PDP-10 and PDP-11). BUT, the basic TCP and IP protocols seem to have been created with a general care that two byte fields should be aligned at multiples of two bytes and four byte fields should be at multiples of four bytes, or at the VERY least, no multi-byte field should cross a four byte boundary (which would have been a pain to PDP-10 programmers who likely expected 32 bits of data in each 36 bit word). IP and TCP Options may blow that out of the water, I don't recall. ISTR that in the IL protocol (atop IP) in early Plan9, the four byte sequence numbers would cross over the lines in a traditional RFC box diagram. And for early TCP/IP networking and alignment pain (with Unibus!): I briefly looked at gluing the TCP/IP stack in MIT-ITS for the 2020 (KS10, a PDP-10) which had a Unibus, to the driver someone had written a Unibus Interlan NI1010 card, BUT the 14 byte Ethernet header meant the 32-bit words in the TCP and IP headers landed across the (36 bit) word boundaries expected by ITS' TCP stack (that had been written to speak to IMPs). There might have been an (MIT added?) BLT to move 8-bit bytes around (or the DEC EXTEND MOVSLJ might have been available), but it was too gross to contemplate... I suppose Berkeley trailers might have solved the problem (had I known about them at the time, but.... also gross). And as for C: It was born on a 16-bit word system that expected 16-bit aligned words, replacing B, a language with just the word datatype, on a system with a 32KB (or less) user space, by people who had just come from a project where waiting for working production PL/I compiler had been a major headache, and generally suffered from "second system syndrome" bloat. I think it's fair to say that the essence of UNIX and the brilliance of the development team (largely lost on modern developers) was to make it as simple as possible, but no simpler. IMO judging Unix, C and it's creators by criteria like "how could they not have anticipated my alignment problems" (or why isn't C like PL/I) is armchair quarterbacking. If you feel otherwise, maybe you should travel back in time and join the Unix-Haters mailing list, or at least not be on this one, insulting people who were involved, and are kind enough to put up with our inane blathering... From johnl at taugh.com Thu Mar 6 14:03:44 2025 From: johnl at taugh.com (John Levine) Date: 5 Mar 2025 23:03:44 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <202503052039.525KdEoB093322@ultimate.com> References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> <202503052039.525KdEoB093322@ultimate.com> Message-ID: <20250306040344.BE5FFBED180B@ary.qy> It appears that Phil Budne said: >Noel Chiappa wrote: >> > From: Larry McVoy >> >> > Not once did I think about packing, the structs somehow just worked on >> > the machines I was working on. Maybe the TCP/IP guys knew about spacing >> > in the structs. >> >> Not really! Of the first 6 TCP/IP implementations: >> only 1 was in C - and it was a relatively late one. The earliest ones were >> mostly in assembler (PDP-10 and PDP-11). > >BUT, the basic TCP and IP protocols seem to have been created with a >general care that two byte fields should be aligned at multiples of >two bytes and four byte fields should be at multiples of four bytes, >or at the VERY least, no multi-byte field should cross a four byte >boundary (which would have been a pain to PDP-10 programmers who >likely expected 32 bits of data in each 36 bit word). There were a bunch of S/360 systems, which also need two- and four-byte fiekds to be aligned unless you do extra byte string moves to pack and unpack the structures. >And as for C: It was born on a 16-bit word system that expected 16-bit >aligned words, replacing B, a language with just the word datatype, on >a system with a 32KB (or less) user space, by people who had just come >from a project where waiting for working production PL/I compiler had >been a major headache, and generally suffered from "second system >syndrome" bloat. I believe the earliest versions of C were on a GE 635, a word addressed machine comparable to a PDP-10. But it moved to a PDP-11 soon enough where the byte and word addresses motivated the datatypes that turned B into C. R's, John From crossd at gmail.com Thu Mar 6 14:35:07 2025 From: crossd at gmail.com (Dan Cross) Date: Wed, 5 Mar 2025 23:35:07 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250306040344.BE5FFBED180B@ary.qy> References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> <202503052039.525KdEoB093322@ultimate.com> <20250306040344.BE5FFBED180B@ary.qy> Message-ID: On Wed, Mar 5, 2025 at 11:11 PM John Levine wrote: >>snip > >And as for C: It was born on a 16-bit word system that expected 16-bit > >aligned words, replacing B, a language with just the word datatype, on > >a system with a 32KB (or less) user space, by people who had just come > >from a project where waiting for working production PL/I compiler had > >been a major headache, and generally suffered from "second system > >syndrome" bloat. > > I believe the earliest versions of C were on a GE 635, a word addressed > machine comparable to a PDP-10. But it moved to a PDP-11 soon enough > where the byte and word addresses motivated the datatypes that turned > B into C. This is a startling claim that I have never heard before. Can you cite a source for this? The history as recounted in Dennis Ritchie's paper on the history of C (https://www.nokia.com/bell-labs/about/dennis-m-ritchie/chist.pdf) is that the initial bootstrapping of PDP-7 Unix was done by cross-assembling on a GE-635, with paper-tape hand carried to the PDP. B appears to have been an invention that began life on the PDP-7 and was subsequently ported to the PDP-11. But C did not appear until they were solidly on the PDP-11, and was developed on that machine. The earliest connections to other machines were Dennis Ritchie constructing a cross-compiler for B that ran on the PDP-7; as he described it: |The most ambitious enterprise I undertook was a genuine cross-compiler |that translated B to GE-635 machine instructions, not threaded code. It was |a small tour de force: a full B compiler, written in its own language and |generating code for a 36-bit mainframe, that ran on an 18-bit machine with |4K words of user address space. C was later ported to GE-635 and IBM 370, but there's no indication it was created on the GE-635. - Dan C From johnl at taugh.com Thu Mar 6 15:28:42 2025 From: johnl at taugh.com (John Levine) Date: 6 Mar 2025 00:28:42 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: References: Message-ID: I thought Dennis said he didn’t the work on the 635 but you’re right, C was mostly on the Pdp-11 and then targeted back to the 635. Please consider the environment before reading this message. John Levine, johnl at taugh.com > On Mar 5, 2025, at 23:36, Dan Cross wrote: > > On Wed, Mar 5, 2025 at 11:11 PM John Levine wrote: >>> snip >>> And as for C: It was born on a 16-bit word system that expected 16-bit >>> aligned words, replacing B, a language with just the word datatype, on >>> a system with a 32KB (or less) user space, by people who had just come >>> from a project where waiting for working production PL/I compiler had >>> been a major headache, and generally suffered from "second system >>> syndrome" bloat. >> >> I believe the earliest versions of C were on a GE 635, a word addressed >> machine comparable to a PDP-10. But it moved to a PDP-11 soon enough >> where the byte and word addresses motivated the datatypes that turned >> B into C. > > This is a startling claim that I have never heard before. Can you cite > a source for this? > > The history as recounted in Dennis Ritchie's paper on the history of C > (https://www.nokia.com/bell-labs/about/dennis-m-ritchie/chist.pdf) is > that the initial bootstrapping of PDP-7 Unix was done by > cross-assembling on a GE-635, with paper-tape hand carried to the PDP. > B appears to have been an invention that began life on the PDP-7 and > was subsequently ported to the PDP-11. But C did not appear until they > were solidly on the PDP-11, and was developed on that machine. The > earliest connections to other machines were Dennis Ritchie > constructing a cross-compiler for B that ran on the PDP-7; as he > described it: > > |The most ambitious enterprise I undertook was a genuine cross-compiler > |that translated B to GE-635 machine instructions, not threaded code. It was > |a small tour de force: a full B compiler, written in its own language and > |generating code for a 36-bit mainframe, that ran on an 18-bit machine with > |4K words of user address space. > > C was later ported to GE-635 and IBM 370, but there's no indication > it was created on the GE-635. > > - Dan C > From lars at nocrew.org Thu Mar 6 16:23:06 2025 From: lars at nocrew.org (Lars Brinkhoff) Date: Thu, 06 Mar 2025 06:23:06 +0000 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: (Dan Cross's message of "Wed, 5 Mar 2025 23:35:07 -0500") References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> <202503052039.525KdEoB093322@ultimate.com> <20250306040344.BE5FFBED180B@ary.qy> Message-ID: <7weczaisol.fsf@junk.nocrew.org> Dan Cross writes: > John Levine wrote: >> I believe the earliest versions of C were on a GE 635, a word addressed >> machine comparable to a PDP-10. But it moved to a PDP-11 soon enough > This is a startling claim that I have never heard before. [...] C did > not appear until they were solidly on the PDP-11, and was developed on > that machine. The earliest connections to other machines were Dennis > Ritchie constructing a cross-compiler for B that ran on the PDP-7 Whilst I agree the claim is startling, there is a connection between early C and the GE 635. The little known Snyder C compiler has back ends for the PDP-10, PDP-11, and Honeywell 6000 series (nee GE 635). https://github.com/PDP-10/Snyder-C-compiler/blob/master/MAC-TR-149.pdf I believe this compiler was started when Snyder visited Bell Labs in 1973, and it comes with a very archaic version of Yacc. From emu at e-bbes.com Thu Mar 6 20:50:36 2025 From: emu at e-bbes.com (emanuel stiebler) Date: Thu, 6 Mar 2025 05:50:36 -0500 Subject: [TUHS] DCJ-11 processor with 20k FPGA In-Reply-To: References: Message-ID: <6c49ba81-94da-4207-997d-c2e3d5fe7812@e-bbes.com> On 2025-03-05 13:54, Jason Stevens wrote: > the code in the repo is for the FPGA, the processor that is strapped to the > FPGA well it runs the real code. > > It's like the 'minimig' Amiga emulator platform, a real processor, and FPGA > to do all the IO heavy lifting. > > So it's not 100% FPGA but you are executing code on a real processor so you > aren't exactly full emulation either. And it doesn't cost a fortune, > assuming you can find one of these ancient microprocessors. You are right, I just found it. Didn't expect the HDL code in the "applications" folder ... From jnc at mercury.lcs.mit.edu Thu Mar 6 21:42:18 2025 From: jnc at mercury.lcs.mit.edu (Noel Chiappa) Date: Thu, 6 Mar 2025 06:42:18 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: <20250306114218.90F8D18C086@mercury.lcs.mit.edu> > From: Phil Budne > BUT, the basic TCP and IP protocols seem to have been created with a > general care that two byte fields should be aligned at multiples of two > bytes Yes, because dealing with a 16-bit field that spans two PDP-11 16-bit words is a pain (espcially because the PDP-11 does not have a 'load byte into register _without_ extending the sign bit into the high half' instruction). Do realize that in addition to the early TCP implementation, the _first_ TCP router (at that stage, TCP and IP were not separate protocols) was also a PDP-11 (albeit programmed in BCPL, not MACRO-11). I remember the extension being a real PITA. To load an un-aligned 16-bit quantity into R0, one would have had to do something like (assuming a pointer to the un-aligned 16-bit quantity was in R1): MOVB (R1)+, R0 SWAB R0 BIC #0377, R0 BISB (R1)+, R0 There may have been a better way to do it, but that's the best I can come up with now; I recall we had to do something like that. Yes, the 16-bit fields were 16-bit word aligned. Noel From jnc at mercury.lcs.mit.edu Thu Mar 6 21:47:36 2025 From: jnc at mercury.lcs.mit.edu (Noel Chiappa) Date: Thu, 6 Mar 2025 06:47:36 -0500 (EST) Subject: [TUHS] Any UNIX With No C In Userland? Message-ID: <20250306114736.0A79418C086@mercury.lcs.mit.edu> > There may have been a better way to do it CLR R0 BISB (R1)+, R0 SWAB R0 BISB (R1)+, R0 That saved one instruction over the previous one. Noel From clemc at ccc.com Fri Mar 7 03:09:54 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 6 Mar 2025 12:09:54 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <20250306040344.BE5FFBED180B@ary.qy> References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> <202503052039.525KdEoB093322@ultimate.com> <20250306040344.BE5FFBED180B@ary.qy> Message-ID: On Wed, Mar 5, 2025 at 11:03 PM John Levine wrote: > > I believe the earliest versions of C were on a GE 635, a word addressed > machine comparable to a PDP-10. But it moved to a PDP-11 soon enough > where the byte and word addresses motivated the datatypes that turne > > I believe the history is this.... Ken write the original B interpreter/subsystem for in Assembl;er (see the Unix V0 stuff). Dennis write a new B compiler in B. That was moved the GE635 (and Steve Johnson brought it with him to Waterloo). When the PDP-11 came, Dennis started change the compiler to add features for the PDP-11. This was originally called "New B" (or nb). He used a continuous development scheme. Originally, nb could accept B, but ass Dennis added more and more features that were not compatible (basically as typing was added), nb became so different it was renamed C. Johnson's comment was that by the time he returned from Waterloo, no one was using B anymore. At some point, Synder (who was a summer student) wrote a C compiler for the PDP-10. I am unclear if that was before the Honeywell 6000 back end, although given the dates, if you look at the code repositories, I >>think<< it was. FWIW: by 1978, Dennis and Brian do not mention the PDP-10 on page 179 of K&R — only the PDP-11, Honeywell, IBM 360/370 and Interdata 8/32. ᐧ ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Fri Mar 7 05:47:43 2025 From: crossd at gmail.com (Dan Cross) Date: Thu, 6 Mar 2025 14:47:43 -0500 Subject: [TUHS] Any UNIX With No C In Userland? In-Reply-To: <7weczaisol.fsf@junk.nocrew.org> References: <20250305062320.D6F3A18C0A4@mercury.lcs.mit.edu> <202503052039.525KdEoB093322@ultimate.com> <20250306040344.BE5FFBED180B@ary.qy> <7weczaisol.fsf@junk.nocrew.org> Message-ID: On Thu, Mar 6, 2025 at 1:23 AM Lars Brinkhoff wrote: > Dan Cross writes: > > John Levine wrote: > >> I believe the earliest versions of C were on a GE 635, a word addressed > >> machine comparable to a PDP-10. But it moved to a PDP-11 soon enough > > This is a startling claim that I have never heard before. [...] C did > > not appear until they were solidly on the PDP-11, and was developed on > > that machine. The earliest connections to other machines were Dennis > > Ritchie constructing a cross-compiler for B that ran on the PDP-7 > > Whilst I agree the claim is startling, there is a connection between > early C and the GE 635. The little known Snyder C compiler has back > ends for the PDP-10, PDP-11, and Honeywell 6000 series (nee GE 635). > > https://github.com/PDP-10/Snyder-C-compiler/blob/master/MAC-TR-149.pdf > > I believe this compiler was started when Snyder visited Bell Labs in > 1973, and it comes with a very archaic version of Yacc. Interesting. I wonder if this is the basis for Dennis mentioning the GE-635 port in the C history paper? - Dan C. From rich.salz at gmail.com Sun Mar 9 01:40:24 2025 From: rich.salz at gmail.com (Rich Salz) Date: Sat, 8 Mar 2025 10:40:24 -0500 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= Message-ID: Gift article about Holmdel facility... Bell Works, the setting of the hit serial for Apple TV+, is now a tourist attraction, drawing fans to the architectural wonder. https://www.nytimes.com/2025/03/07/realestate/severance-lumon-industries-building-bell-labs.html?unlocked_article_code=1.2U4.9nGx.iMzMqZbSep6U&smid=em-share -------------- next part -------------- An HTML attachment was scrubbed... URL: From henry.r.bent at gmail.com Sun Mar 9 02:10:17 2025 From: henry.r.bent at gmail.com (Henry Bent) Date: Sat, 8 Mar 2025 11:10:17 -0500 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: On Sat, 8 Mar 2025 at 10:40, Rich Salz wrote: > Gift article about Holmdel facility... > > Bell Works, the setting of the hit serial for Apple TV+, is now a tourist > attraction, drawing fans to the architectural wonder. > > > https://www.nytimes.com/2025/03/07/realestate/severance-lumon-industries-building-bell-labs.html?unlocked_article_code=1.2U4.9nGx.iMzMqZbSep6U&smid=em-share > > Thank you for sharing this. It's so nice to know that the space has continued to be used productively. Crawford Hill, where my father worked, had a longer and more drawn out end but it looks like everything has worked out positively and it's going to be a park. -Henry -------------- next part -------------- An HTML attachment was scrubbed... URL: From robpike at gmail.com Sun Mar 9 07:08:26 2025 From: robpike at gmail.com (Rob Pike) Date: Sun, 9 Mar 2025 08:08:26 +1100 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: As someone who worked there occasionally, I can say that it's an amazing building but from a workplace perspective full of architectural ideas that simply do not work in practice. No office has a window - the outer walkways take all the windows, and all the offices face a transverse corridor. (In a later ridiculously expensive expansion for a building Saarinen said could never be expanded, they gave the president's office a window on the end. You might have seen it in the show.) Only artificial light therefore.The huge central atrium is cool but of course its floor was soon filled with cubicles, ruining the effect and eventually requiring a netting over it to stop people throwing things on the workers below. And so on. Great work happened there but the building, externally at least a straightened TJ Watson, was a failure in my amateur opinion. What a dreary workplace it was. But the approach is nice, at least until the geese took over and became serious menaces to arrivals. It's a fitting locale for the show. No use of the geese though. Maybe they've moved on. I will say that with that (I think) opening shot of series of the water tower, I shrieked. -rob On Sun, Mar 9, 2025 at 2:47 AM Rich Salz wrote: > > Gift article about Holmdel facility... > > Bell Works, the setting of the hit serial for Apple TV+, is now a tourist attraction, drawing fans to the architectural wonder. > > https://www.nytimes.com/2025/03/07/realestate/severance-lumon-industries-building-bell-labs.html?unlocked_article_code=1.2U4.9nGx.iMzMqZbSep6U&smid=em-share From tuhs at tuhs.org Sun Mar 9 07:15:26 2025 From: tuhs at tuhs.org (=?utf-8?b?UGV0ZXIgV2VpbmJlcmdlciAo5rip5Y2a5qC8KSB2aWEgVFVIUw==?=) Date: Sat, 8 Mar 2025 16:15:26 -0500 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: Rob is overly generous about the space. But great work did happen there. On Sat, Mar 8, 2025 at 4:08 PM Rob Pike wrote: > > As someone who worked there occasionally, I can say that it's an > amazing building but from a workplace perspective full of > architectural ideas that simply do not work in practice. No office has > a window - the outer walkways take all the windows, and all the > offices face a transverse corridor. (In a later ridiculously expensive > expansion for a building Saarinen said could never be expanded, they > gave the president's office a window on the end. You might have seen > it in the show.) Only artificial light therefore.The huge central > atrium is cool but of course its floor was soon filled with cubicles, > ruining the effect and eventually requiring a netting over it to stop > people throwing things on the workers below. And so on. Great work > happened there but the building, externally at least a straightened TJ > Watson, was a failure in my amateur opinion. > > What a dreary workplace it was. > > But the approach is nice, at least until the geese took over and > became serious menaces to arrivals. > > It's a fitting locale for the show. No use of the geese though. Maybe > they've moved on. > > I will say that with that (I think) opening shot of series of the > water tower, I shrieked. > > -rob > > On Sun, Mar 9, 2025 at 2:47 AM Rich Salz wrote: > > > > Gift article about Holmdel facility... > > > > Bell Works, the setting of the hit serial for Apple TV+, is now a tourist attraction, drawing fans to the architectural wonder. > > > > https://www.nytimes.com/2025/03/07/realestate/severance-lumon-industries-building-bell-labs.html?unlocked_article_code=1.2U4.9nGx.iMzMqZbSep6U&smid=em-share From jon at fourwinds.com Sun Mar 9 07:17:42 2025 From: jon at fourwinds.com (Jon Steinhart) Date: Sat, 08 Mar 2025 13:17:42 -0800 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: <202503082117.528LHgpV148779@darkstar.fourwinds.com> Rob Pike writes: > As someone who worked there occasionally, I can say that it's an > amazing building but from a workplace perspective full of > architectural ideas that simply do not work in practice. No office has > a window - the outer walkways take all the windows, and all the > offices face a transverse corridor. (In a later ridiculously expensive > expansion for a building Saarinen said could never be expanded, they > gave the president's office a window on the end. You might have seen > it in the show.) Only artificial light therefore.The huge central > atrium is cool but of course its floor was soon filled with cubicles, > ruining the effect and eventually requiring a netting over it to stop > people throwing things on the workers below. And so on. Great work > happened there but the building, externally at least a straightened TJ > Watson, was a failure in my amateur opinion. > > What a dreary workplace it was. > > But the approach is nice, at least until the geese took over and > became serious menaces to arrivals. > > It's a fitting locale for the show. No use of the geese though. Maybe > they've moved on. > > I will say that with that (I think) opening shot of series of the > water tower, I shrieked. > > -rob A couple of other Holmdel anecdotes. I fortunately was at Murray Hill but occasionally had to go down to Holmdel. When they first built the building the didn't think to use anti-reflective glass. They had to replace it because they were blinding motorists on the Garden State Parkway every day at sunset. If you were a latecomer to Holmdel you might wonder why they put so much money into that huge planter full of artificial plants in the atrium. It wasn't always that way. It used to have real plants and real dirt. But Roy, who's last name I can't remember, went up to the top floor balcony with a bespoke pea shooter and fired pot seeds into the planter which took. End of dirt. Unless my aging memory is confusing it with Whippany, I think that one of the cool things about the site was using the fountains at air conditioning chillers. Jon From aek at bitsavers.org Sun Mar 9 08:04:24 2025 From: aek at bitsavers.org (Al Kossow) Date: Sat, 8 Mar 2025 14:04:24 -0800 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> On 3/8/25 1:08 PM, Rob Pike wrote: > As someone who worked there occasionally, I can say that it's an > amazing building but from a workplace perspective full of > architectural ideas that simply do not work in practice. I've heard the same from people working in the Apple Mothership. From ggm at algebras.org Sun Mar 9 08:57:37 2025 From: ggm at algebras.org (George Michaelson) Date: Sun, 9 Mar 2025 08:57:37 +1000 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> References: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> Message-ID: Steve Deering spoke fondly of the PARC building, it's possible some radical architectural ideas work? If i recall the sense of his words it was democratising, communal in spirit. G -------------- next part -------------- An HTML attachment was scrubbed... URL: From robpike at gmail.com Sun Mar 9 10:51:49 2025 From: robpike at gmail.com (Rob Pike) Date: Sun, 9 Mar 2025 11:51:49 +1100 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> Message-ID: I visited PARC a few times and found it salubrious. The culture was peculiar (not necessarily in a bad way, but I didn't understand yet how SV worked (literally)), but the building was cosy and the people seemed happy. I sometimes wonder whether the reverence we give to architects is fully earned. -rob On Sun, Mar 9, 2025 at 10:07 AM George Michaelson wrote: > > Steve Deering spoke fondly of the PARC building, it's possible some radical architectural ideas work? If i recall the sense of his words it was democratising, communal in spirit. > > G From rminnich at gmail.com Sun Mar 9 11:03:11 2025 From: rminnich at gmail.com (ron minnich) Date: Sat, 8 Mar 2025 17:03:11 -0800 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: Part of my 1983 interview trip to Bell Labs (Allentown, Piscataway, MH, Holmdel, I recall) was at Holmdel, and the engineers there had little good to say about the building. My host apologized for the Holmdel offices. Holmdel, designed by a World Famous Architect, with Big Ideas, reminds me of an old rule: invariably, the ugliest building on campus is the architecture college. That certainly is true at UCB, but applies to many universities I've seen. ron p.s. I was visiting the Empire right before dissolution, and they had to explain the yellow tapes in Allentown that were going up in the hallways, marking the future BellCore/ATT boundaries. At that time, 1983, more semiconductors were made in PA, by Western Electric, than anywhere. The Bell System had a voracious appetite for hardware. p.p.s. Nope, didn't go there. One of my hosts gave me good advice: "don't come here, just go get your PhD." So I did. On Sat, Mar 8, 2025 at 1:22 PM Peter Weinberger (温博格) via TUHS < tuhs at tuhs.org> wrote: > Rob is overly generous about the space. But great work did happen there. > > On Sat, Mar 8, 2025 at 4:08 PM Rob Pike wrote: > > > > As someone who worked there occasionally, I can say that it's an > > amazing building but from a workplace perspective full of > > architectural ideas that simply do not work in practice. No office has > > a window - the outer walkways take all the windows, and all the > > offices face a transverse corridor. (In a later ridiculously expensive > > expansion for a building Saarinen said could never be expanded, they > > gave the president's office a window on the end. You might have seen > > it in the show.) Only artificial light therefore.The huge central > > atrium is cool but of course its floor was soon filled with cubicles, > > ruining the effect and eventually requiring a netting over it to stop > > people throwing things on the workers below. And so on. Great work > > happened there but the building, externally at least a straightened TJ > > Watson, was a failure in my amateur opinion. > > > > What a dreary workplace it was. > > > > But the approach is nice, at least until the geese took over and > > became serious menaces to arrivals. > > > > It's a fitting locale for the show. No use of the geese though. Maybe > > they've moved on. > > > > I will say that with that (I think) opening shot of series of the > > water tower, I shrieked. > > > > -rob > > > > On Sun, Mar 9, 2025 at 2:47 AM Rich Salz wrote: > > > > > > Gift article about Holmdel facility... > > > > > > Bell Works, the setting of the hit serial for Apple TV+, is now a > tourist attraction, drawing fans to the architectural wonder. > > > > > > > https://www.nytimes.com/2025/03/07/realestate/severance-lumon-industries-building-bell-labs.html?unlocked_article_code=1.2U4.9nGx.iMzMqZbSep6U&smid=em-share > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sauer at technologists.com Sun Mar 9 11:08:38 2025 From: sauer at technologists.com (Charles H Sauer (he/him)) Date: Sat, 8 Mar 2025 19:08:38 -0600 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> Message-ID: I really liked working in IBM Building 801, and Saarinen might deserve some of the credit, but the people, the semi-rural environment, the pleasant bicycle commute (https://technologists.com/sauer/songs/swans.html), working from home half the time in our 140 year old farm house with view of the Hudson, ... all probably deserve more of the credit. I've worked in some other elegant office buildings, e.g., the Dell Arboretum Point building in 1989-90, but the best times at Dell were after we (development) moved to one-story tilt wall structures in an industrial park. Again, it wasn't the building that mattered, but the people, the work, working from home, ... Charlie On 3/8/2025 6:51 PM, Rob Pike wrote: > I visited PARC a few times and found it salubrious. The culture was > peculiar (not necessarily in a bad way, but I didn't understand yet > how SV worked (literally)), but the building was cosy and the people > seemed happy. > > I sometimes wonder whether the reverence we give to architects is fully earned. > > -rob > > On Sun, Mar 9, 2025 at 10:07 AM George Michaelson wrote: >> >> Steve Deering spoke fondly of the PARC building, it's possible some radical architectural ideas work? If i recall the sense of his words it was democratising, communal in spirit. >> >> G -- voice: +1.512.784.7526 e-mail: sauer at technologists.com fax: +1.512.346.5240 Web: https://technologists.com/sauer/ Facebook/Google/LinkedIn/mas.to: CharlesHSauer From lm at mcvoy.com Sun Mar 9 11:11:09 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sat, 8 Mar 2025 17:11:09 -0800 Subject: [TUHS] Step Into the Real-Life Lumon Industries, the Breakout Star of ???Severance??? In-Reply-To: References: <2b9ac112-d1b2-9f9c-52cc-c30a7389317b@bitsavers.org> Message-ID: <20250309011109.GE24601@mcvoy.com> Welp, you made me look up salubrious. That's a pretty neat word, I had not seen that one before and I read a lot. So thanks for that. I sort of worked for a PARC guy and the word fits what little I know of PARC. On Sun, Mar 09, 2025 at 11:51:49AM +1100, Rob Pike wrote: > I visited PARC a few times and found it salubrious. The culture was > peculiar (not necessarily in a bad way, but I didn't understand yet > how SV worked (literally)), but the building was cosy and the people > seemed happy. > > I sometimes wonder whether the reverence we give to architects is fully earned. > > -rob > > On Sun, Mar 9, 2025 at 10:07???AM George Michaelson wrote: > > > > Steve Deering spoke fondly of the PARC building, it's possible some radical architectural ideas work? If i recall the sense of his words it was democratising, communal in spirit. > > > > G -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From stewart at serissa.com Sun Mar 9 11:30:30 2025 From: stewart at serissa.com (Serissa) Date: Sat, 8 Mar 2025 20:30:30 -0500 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: Message-ID: <92308B26-EC3B-4945-8ECC-5312DE52ED7F@serissa.com> > On Mar 8, 2025, at 7:52 PM, Rob Pike wrote: > > I visited PARC a few times and found it salubrious. The culture was > peculiar (not necessarily in a bad way, but I didn't understand yet > how SV worked (literally)), but the building was cosy and the people > seemed happy. I worked in the PARC building as a grad student and later as research staff. I liked the building, not least because one could bike there from campus or houses in Palo Alto. As for the culture, I think the book "Dealers of Lightning" is fairly accurate. In the 70's and 80's there was little connection between PARC and the rest of Silicon Valley. PARC had lots of contacts and visitors from Stanford, Berkeley, MIT, CMU, Cambridge UK, ETH, and other schools, but not all that much interchange with other SV companies. Connections were stronger with Xerox in LA as they tried to make a go of products. I was especially fond of the PARC softball leagues and trips to the Goose in Menlo Park. After a bunch of folks (including me) left to join Digital research labs in Palo Alto, there were still more connections with universities and with Digital back east than with the rest of the valley. There was little contact with other industrial research except for program committees. As for the Unix Way (tm) I think the folks at PARC were honestly puzzled, if they thought about it at all. Most were Tenex sorts of folks, or interested in languages, GUIs, and distributed computing. Unix was time sharing, and something you did if you didn't have your own computer. -L From robpike at gmail.com Sun Mar 9 12:05:05 2025 From: robpike at gmail.com (Rob Pike) Date: Sun, 9 Mar 2025 13:05:05 +1100 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: <92308B26-EC3B-4945-8ECC-5312DE52ED7F@serissa.com> References: <92308B26-EC3B-4945-8ECC-5312DE52ED7F@serissa.com> Message-ID: > > As for the Unix Way (tm) I think the folks at PARC were honestly puzzled, > if they thought about it at all. Most were Tenex sorts of folks, or > interested in languages, GUIs, and distributed computing. Unix was time > sharing, and something you did if you didn't have your own computer. I was at PARC in 1984, working with Dan Ingalls. I mentioned I was surprised that Smalltalk had no concurrency†, that the UI (let alone the system) was completely single-threaded. Only the window with focus could execute any code. Dan being Dan, he immediately got to work making a form of concurrency happen, followed by a delightful orgy of researches playing with the new toy. I loved it. Because: sometimes in isolation you miss important things going on in the outside world. -rob † The starting idea for the Blit né Jerq‡ was bringing a UI to Unix that supported parallel execution, after a demo of the Three Rivers Perqs at Lucasfilm, an emulation of the Alto, and seeing only missed opportunities. ‡ The name "Jerq" was Lucasfilm's own moniker for the Perq, and we asked them for permission to use it, which they happily provided. -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Sun Mar 9 13:46:58 2025 From: crossd at gmail.com (Dan Cross) Date: Sat, 8 Mar 2025 22:46:58 -0500 Subject: [TUHS] What would early alternatives to C have been? Message-ID: As I mentioned in the discussion about C, it's easy to look back with a modern perspective and cast aspersions on C. But this got me thinking, what would possible alternatives have been? In the context of the very late 1960s heading into the early 70s, and given the constraints of the PDP-7 and early PDP-11s, what languages would one consider for implementing a system like early Unix? Dennis's history paper mentioned a very short-lived effort at Fortran, and I asked about that a few years ago, but no one really remembered much about it; I gather this was an experiment that lasted a few days or weeks and was quickly abandoned. But what else? My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was already mentioned. I don't think PL/1 (or PL/I) could have fit on those machines. Pascal was really targeted towards teaching and would have required pretty extensive work to be usable. The big question mark in my mind is Algol/W; how well known was it at the time? Was any consideration for it made? Obviously, the decision to go with BCPL (as the basis for B, which beget C) was made and the rest is history. But I'm really curious about how, in the research culture at the time, information about new programming languages made its way through the community. - Dan C. From tuhs at tuhs.org Sun Mar 9 14:14:51 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sat, 8 Mar 2025 20:14:51 -0800 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: References: <92308B26-EC3B-4945-8ECC-5312DE52ED7F@serissa.com> Message-ID: <76469579-7F8E-40B4-9F7B-63089211AE3D@iitbombay.org> On Mar 8, 2025, at 6:05 PM, Rob Pike wrote: > > I was at PARC in 1984, working with Dan Ingalls. I mentioned I was surprised that Smalltalk had no concurrency†, that the UI (let alone the system) was completely single-threaded. Only the window with focus could execute any code. Dan being Dan, he immediately got to work making a form of concurrency happen, followed by a delightful orgy of researches playing with the new toy. I loved it. > > Because: sometimes in isolation you miss important things going on in the outside world. Surely they must've read papers on concurrency & were aware of CSP, monitors, the Actor model etc? A few years ago at a dinner I had asked Don Knuth whether he was going to write any books on parallel algorithms. Alas, I don't recall his exact answer but he didn't seem keen on the idea -- I was a bit surprised but thinking more about it, it made sense. [Still would like to see someone attempt a Knuth style encyclopedic treatment to the subject of concurrent/parallel algorithms!] -------------- next part -------------- An HTML attachment was scrubbed... URL: From robpike at gmail.com Sun Mar 9 15:20:07 2025 From: robpike at gmail.com (Rob Pike) Date: Sun, 9 Mar 2025 16:20:07 +1100 Subject: [TUHS] =?utf-8?q?Step_Into_the_Real-Life_Lumon_Industries=2C_the?= =?utf-8?b?IEJyZWFrb3V0IFN0YXIgb2Yg4oCYU2V2ZXJhbmNl4oCZ?= In-Reply-To: <76469579-7F8E-40B4-9F7B-63089211AE3D@iitbombay.org> References: <92308B26-EC3B-4945-8ECC-5312DE52ED7F@serissa.com> <76469579-7F8E-40B4-9F7B-63089211AE3D@iitbombay.org> Message-ID: I'm sure they knew about that, but had never considered the consequences for user interfaces. -rob On Sun, Mar 9, 2025 at 3:15 PM Bakul Shah wrote: > On Mar 8, 2025, at 6:05 PM, Rob Pike wrote: > > > I was at PARC in 1984, working with Dan Ingalls. I mentioned I was > surprised that Smalltalk had no concurrency†, that the UI (let alone the > system) was completely single-threaded. Only the window with focus could > execute any code. Dan being Dan, he immediately got to work making a form > of concurrency happen, followed by a delightful orgy of researches playing > with the new toy. I loved it. > > Because: sometimes in isolation you miss important things going on in the > outside world. > > > Surely they must've read papers on concurrency & were aware of CSP, > monitors, the Actor model etc? > > A few years ago at a dinner I had asked Don Knuth whether he was going to > write any books on parallel algorithms. Alas, I don't recall his exact > answer but he didn't seem keen on the idea -- I was a bit surprised but > thinking more about it, it made sense. [Still would like to see someone > attempt a Knuth style encyclopedic treatment to the subject of > concurrent/parallel algorithms!] > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ggm at algebras.org Sun Mar 9 16:14:26 2025 From: ggm at algebras.org (George Michaelson) Date: Sun, 9 Mar 2025 16:14:26 +1000 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: Bliss. After all, DEC stuck to it either Bliss 32. On Sun, 9 Mar 2025, 1:47 pm Dan Cross, wrote: > As I mentioned in the discussion about C, it's easy to look back with > a modern perspective and cast aspersions on C. But this got me > thinking, what would possible alternatives have been? In the context > of the very late 1960s heading into the early 70s, and given the > constraints of the PDP-7 and early PDP-11s, what languages would one > consider for implementing a system like early Unix? Dennis's history > paper mentioned a very short-lived effort at Fortran, and I asked > about that a few years ago, but no one really remembered much about > it; I gather this was an experiment that lasted a few days or weeks > and was quickly abandoned. But what else? > > My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was > already mentioned. I don't think PL/1 (or PL/I) could have fit on > those machines. Pascal was really targeted towards teaching and would > have required pretty extensive work to be usable. The big question > mark in my mind is Algol/W; how well known was it at the time? Was any > consideration for it made? > > Obviously, the decision to go with BCPL (as the basis for B, which > beget C) was made and the rest is history. But I'm really curious > about how, in the research culture at the time, information about new > programming languages made its way through the community. > > - Dan C. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Sun Mar 9 22:29:43 2025 From: clemc at ccc.com (Clem Cole) Date: Sun, 9 Mar 2025 08:29:43 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: ESPOL was basically first, with BCPL, PL/360, and, as George mentioned, BLISS showing up soon thereafter. On Sat, Mar 8, 2025 at 10:47 PM Dan Cross wrote: > The big question mark in my mind is Algol/W; how well known was it at the > time? Any 360 shop, particularly if it was targeted for teaching students, would likely have had Wirth's compiler. Remember, as BCPL was to CPL, PL/360 was to Algol-W. Unlike BCPL, I don't know of any port of PL/360 outside of the IBM world. Algol-W would later be implemented in C, but that was later after Wirth tried again with Pascal. > Was any consideration for it made? > Only Ken can answer that. My guess is that Algol-W would have failed for the same reasons Pascal ultimately failed. It really was directed at students and lacked many of the tools C had. Many of the issues exposed in Brian's treatise *"Why is Pascal not my favorite programming language"* also apply to Algol-W. At the time, BLISS had the advantage of the first "Green Book" style optimizer, so in comparison to the original B and later C compilers, their code generators were almost toys. I believe that the killer for BLISS was DEC's choice to charge $5K per/cpu and to few places were willing to pay it. But as Paul points out BLISS was word-oriented, which, from a technical point of view, made it more difficult to use than C. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From arnold at skeeve.com Sun Mar 9 22:49:17 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Sun, 09 Mar 2025 06:49:17 -0600 Subject: [TUHS] BWK on possible alternative programming languages Message-ID: <202503091249.529CnHXt105980@freefriends.org> I asked BWK if he had any thoughts about possible alternative languages. Here is his response, forwarded by permission. Arnold > Date: Sun, 9 Mar 2025 08:27:57 -0400 (EDT) > From: Brian Kernighan > To: arnold at skeeve.com > cc: crossd at gmail.com, Brian Kernighan > Subject: Re: An interesting history question > > Dan raises an interesting question. I don't have a good answer, > but there are possibilities. > > Typeless languages like BCPL were in the air; Bliss, from CMU in > 1970, was a significant example, used mostly on the PDP-10 but it > could run on a PDP-11. It was definitely a contender for doing > systems work. > > I used MAD in the summer of 1966 at MIT and remembered it as being > much nicer than Fortran, though when I looked at a description a > while ago, it wasn't clear what the attraction was. > > Bell Labs (Doug McIlroy and Bob Morris, mostly) made a PL/I subset > called EPL that was at least compilable and a lot easier to manage > than the full language. I don't know whether that would have > worked, but it would seem that Ken didn't think so, since he went > off on his own direction. Doug would know more; he sent me some > corrective info a month ago, on the errata page here: > > https://www.cs.princeton.edu/~bwk/memoir.html > > Fortran would have needed major work to handle non-numeric data. > I wrote a text formatter in it by hacking with the Logical*1 type; > that let me handle one character at a time by basically lying, > though I've long since forgotten the details. > > Pascal was hopeless, as I have described elsewhere, though > variants that repaired some of the type system might have worked. > > The US military used Jovial; it sounds like it's still sort of in > use, since it handles the avionics in a lot of planes. It looks > like a direct descendant of Algol 58. > > I never used Algol/W, but of all the options, it seems like it > might have been the strongest contender. > > Xerox PARC had Mesa, but my dim memory is that it was big and > complicated, which is the opposite of what was needed at the time. > It also came along too late, mid to late 1970s. It did influence > Java and Modula-2, says Wikipedia. > > HOPL 1 includes papers on other languages of the time, most of > which would not have worked, and/or have died by now. There's a > lot of history, and I have no idea how to get on top of it all. > But still interesting to look at and speculate about. > > Brian > > > On Sat, 8 Mar 2025, arnold at skeeve.com wrote: > > > Hi Brian. > > > > Any thoughts on this? > > > > (cc-ing Dan, the original poster) > > > > Thanks, > > > > Arnold > > > >> From: Dan Cross > >> Date: Sat, 8 Mar 2025 22:46:58 -0500 > >> To: TUHS > >> Subject: [TUHS] What would early alternatives to C have been? > >> > >> As I mentioned in the discussion about C, it's easy to look back with > >> a modern perspective and cast aspersions on C. But this got me > >> thinking, what would possible alternatives have been? In the context > >> of the very late 1960s heading into the early 70s, and given the > >> constraints of the PDP-7 and early PDP-11s, what languages would one > >> consider for implementing a system like early Unix? Dennis's history > >> paper mentioned a very short-lived effort at Fortran, and I asked > >> about that a few years ago, but no one really remembered much about > >> it; I gather this was an experiment that lasted a few days or weeks > >> and was quickly abandoned. But what else? > >> > >> My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was > >> already mentioned. I don't think PL/1 (or PL/I) could have fit on > >> those machines. Pascal was really targeted towards teaching and would > >> have required pretty extensive work to be usable. The big question > >> mark in my mind is Algol/W; how well known was it at the time? Was any > >> consideration for it made? > >> > >> Obviously, the decision to go with BCPL (as the basis for B, which > >> beget C) was made and the rest is history. But I'm really curious > >> about how, in the research culture at the time, information about new > >> programming languages made its way through the community. > >> > >> - Dan C. > >> > > > From g.branden.robinson at gmail.com Sun Mar 9 23:18:31 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Sun, 9 Mar 2025 08:18:31 -0500 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: <20250309131831.ol553bn3fo7jdcjb@illithid> At 2025-03-09T08:29:43-0400, Clem Cole wrote: > On Sat, Mar 8, 2025 at 10:47 PM Dan Cross wrote: > > Was any consideration for it made? > > > Only Ken can answer that. My guess is that Algol-W would have failed > for the same reasons Pascal ultimately failed. It really was > directed at students and lacked many of the tools C had. Many of the > issues exposed in Brian's treatise *"Why is Pascal not my favorite > programming language"* also apply to Algol-W. I think it's worth noting that, as I undertand it from having read the paper, old Pascal manuals, and ISO 7185 (and, a long time ago, experience with Borland Turbo Pascal), most or all of the defects Kernighan pointed out were addressed by vendored versions of Pascal and ultimately by its ISO standard form. I think even UCSD Pascal had extensions, though I don't recall what they were. My grasp of the chronology is poor, though, and I'd concede that Wirth's Pascal of his 1973 memorandum wears most or all of the mud Kernighan threw at it. For the most part, solving the problems Kerngihan observed did not require doing violence to the language (in other words, they could be solved with extensions rather than by altering the semantics of valid 1973 Pascal syntax). Maybe this is what it means to have been "directed at students"--that easy wins in terms of language improvement were not implemented because covering them topically in a semester-long university course would not have been practical. If so, that may make the tragedy of Pascal greater, not smaller. Though even that doesn't explain why Wirth didn't spec Pascal functions to admit polymorphism (perhaps optionally) in array length. That's an essential property for all sorts of work, even that undertaken by students. (Say you want them to write a function that multiplies matrices. You shouldn't have to specialize the function by matrix dimensions. The obvious pinch point for a Unix programmer is massive inconvenience if you want to handle variable-length strings.) Perusing its manual some months ago, it occurred to me that if someone had had the foresight to implement the language core of Turbo Pascal 7.0, ignoring the libraries/"modules" entirely, and omitting the crap necessary to work with the x86 segmented memory model, I don't think Kernighan would have had much, if anything, to complain about. And, in my opinion, we'd be living in a better timeline. But I guess Pascal just didn't get good enough, soon enough. And I wonder how big such a compiler would be, even if coded competently. Chronology bedevils us in another way. A _lot_ of C language features its advocates are proud of were not present in 1973 or even when Kernighan wrote CSTR #100 around 1980. It is an error to substitute the ANSI C of 1989 for the C that actually existed at the time when asking who'd win in a fight between C and Pascal taking place in, say, 1977. https://gunkies.org/wiki/Typesetter_C Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From tuhs at tuhs.org Mon Mar 10 02:59:06 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sun, 9 Mar 2025 09:59:06 -0700 Subject: [TUHS] BWK on possible alternative programming languages In-Reply-To: <202503091249.529CnHXt105980@freefriends.org> References: <202503091249.529CnHXt105980@freefriends.org> Message-ID: <77548CFF-CE07-4498-B9A6-BD5A5BA31BD3@iitbombay.org> Pascal was heavily influenced by Algol/W. From wikipedia: ALGOL W served as the basis for the Pascal language, and the syntax of ALGOL W will be immediately familiar to anyone with Pascal experience. The key differences are improvements to record handling in Pascal, and, oddly, the loss of ALGOL W's ability to define the length of an array at runtime, which is one of Pascal's most-complained-about features. Algol/w or Pascal, with a few extensions would have been perfectly fine (& arguably a better choice than C) for implementing an OS. IMHO. Algol/W might even have been better but it was perhaps "too big" for small computers at the time? Pascal was simpler. Algol/W reference manual: https://www.softwarepreservation.org/projects/ALGOL/manual/Eve-AlgolW_Manual-1972.pdf Looking at it now, it had strings, dynamic data structures, complex numbers, extended precision numbers, call by name, block expressions (like gnu CC's { .... ; value }) and so on. > On Mar 9, 2025, at 5:49 AM, arnold at skeeve.com wrote: > > I asked BWK if he had any thoughts about possible alternative > languages. Here is his response, forwarded by permission. > > Arnold > >> Date: Sun, 9 Mar 2025 08:27:57 -0400 (EDT) >> From: Brian Kernighan >> To: arnold at skeeve.com >> cc: crossd at gmail.com, Brian Kernighan >> Subject: Re: An interesting history question >> >> Dan raises an interesting question. I don't have a good answer, >> but there are possibilities. >> >> Typeless languages like BCPL were in the air; Bliss, from CMU in >> 1970, was a significant example, used mostly on the PDP-10 but it >> could run on a PDP-11. It was definitely a contender for doing >> systems work. >> >> I used MAD in the summer of 1966 at MIT and remembered it as being >> much nicer than Fortran, though when I looked at a description a >> while ago, it wasn't clear what the attraction was. >> >> Bell Labs (Doug McIlroy and Bob Morris, mostly) made a PL/I subset >> called EPL that was at least compilable and a lot easier to manage >> than the full language. I don't know whether that would have >> worked, but it would seem that Ken didn't think so, since he went >> off on his own direction. Doug would know more; he sent me some >> corrective info a month ago, on the errata page here: >> >> https://www.cs.princeton.edu/~bwk/memoir.html >> >> Fortran would have needed major work to handle non-numeric data. >> I wrote a text formatter in it by hacking with the Logical*1 type; >> that let me handle one character at a time by basically lying, >> though I've long since forgotten the details. >> >> Pascal was hopeless, as I have described elsewhere, though >> variants that repaired some of the type system might have worked. >> >> The US military used Jovial; it sounds like it's still sort of in >> use, since it handles the avionics in a lot of planes. It looks >> like a direct descendant of Algol 58. >> >> I never used Algol/W, but of all the options, it seems like it >> might have been the strongest contender. >> >> Xerox PARC had Mesa, but my dim memory is that it was big and >> complicated, which is the opposite of what was needed at the time. >> It also came along too late, mid to late 1970s. It did influence >> Java and Modula-2, says Wikipedia. >> >> HOPL 1 includes papers on other languages of the time, most of >> which would not have worked, and/or have died by now. There's a >> lot of history, and I have no idea how to get on top of it all. >> But still interesting to look at and speculate about. >> >> Brian >> >> >> On Sat, 8 Mar 2025, arnold at skeeve.com wrote: >> >>> Hi Brian. >>> >>> Any thoughts on this? >>> >>> (cc-ing Dan, the original poster) >>> >>> Thanks, >>> >>> Arnold >>> >>>> From: Dan Cross >>>> Date: Sat, 8 Mar 2025 22:46:58 -0500 >>>> To: TUHS >>>> Subject: [TUHS] What would early alternatives to C have been? >>>> >>>> As I mentioned in the discussion about C, it's easy to look back with >>>> a modern perspective and cast aspersions on C. But this got me >>>> thinking, what would possible alternatives have been? In the context >>>> of the very late 1960s heading into the early 70s, and given the >>>> constraints of the PDP-7 and early PDP-11s, what languages would one >>>> consider for implementing a system like early Unix? Dennis's history >>>> paper mentioned a very short-lived effort at Fortran, and I asked >>>> about that a few years ago, but no one really remembered much about >>>> it; I gather this was an experiment that lasted a few days or weeks >>>> and was quickly abandoned. But what else? >>>> >>>> My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was >>>> already mentioned. I don't think PL/1 (or PL/I) could have fit on >>>> those machines. Pascal was really targeted towards teaching and would >>>> have required pretty extensive work to be usable. The big question >>>> mark in my mind is Algol/W; how well known was it at the time? Was any >>>> consideration for it made? >>>> >>>> Obviously, the decision to go with BCPL (as the basis for B, which >>>> beget C) was made and the rest is history. But I'm really curious >>>> about how, in the research culture at the time, information about new >>>> programming languages made its way through the community. >>>> >>>> - Dan C. >>>> >>> >> From rich.salz at gmail.com Mon Mar 10 03:05:28 2025 From: rich.salz at gmail.com (Rich Salz) Date: Sun, 9 Mar 2025 13:05:28 -0400 Subject: [TUHS] BWK on possible alternative programming languages In-Reply-To: <202503091249.529CnHXt105980@freefriends.org> References: <202503091249.529CnHXt105980@freefriends.org> Message-ID: On Sun, Mar 9, 2025, 8:50 AM wrote: > I asked BWK ... > I can't believe that after nearly 40 years I only just now saw the similarity between BWK and awk. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralph at inputplus.co.uk Mon Mar 10 03:21:22 2025 From: ralph at inputplus.co.uk (Ralph Corderoy) Date: Sun, 09 Mar 2025 17:21:22 +0000 Subject: [TUHS] BWK on possible alternative programming languages In-Reply-To: References: <202503091249.529CnHXt105980@freefriends.org> Message-ID: <20250309172122.2578521420@orac.inputplus.co.uk> Hi Rich, > > I asked BWK ... > > I can't believe that after nearly 40 years I only just now saw the > similarity between BWK and awk. Though AWK isn't named as BWK's earlier prototype, a --BWK, but after AWK's authors, Aho, Weinberger, and Kernighan, who also wrote the book ‘The AWK Programming Language’. -- Cheers, Ralph. From clemc at ccc.com Mon Mar 10 03:29:55 2025 From: clemc at ccc.com (Clem Cole) Date: Sun, 9 Mar 2025 13:29:55 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309131831.ol553bn3fo7jdcjb@illithid> References: <20250309131831.ol553bn3fo7jdcjb@illithid> Message-ID: On Sun, Mar 9, 2025 at 9:18 AM G. Branden Robinson < g.branden.robinson at gmail.com> wrote: > At 2025-03-09T08:29:43-0400, Clem Cole wrote: > > On Sat, Mar 8, 2025 at 10:47 PM Dan Cross wrote: > > > Was any consideration for it made? > > > > > Only Ken can answer that. My guess is that Algol-W would have failed > > for the same reasons Pascal ultimately failed. It really was > > directed at students and lacked many of the tools C had. Many of the > > issues exposed in Brian's treatise *"Why is Pascal not my favorite > > programming language"* also apply to Algol-W. > > I think it's worth noting that, as I undertand it from having read the > paper, old Pascal manuals, and ISO 7185 (and, a long time ago, > experience with Borland Turbo Pascal), most or all of the defects > Kernighan pointed out were addressed by vendored versions of Pascal and > ultimately by its ISO standard form. I think even UCSD Pascal had > extensions, though I don't recall what they were. > Branden — respectfully, if you download the standard https://archive.org/details/iso-iec-7185-1990-Pascal or look at versions that have been converted to HTML such as https://wiki.freepascal.org/Standard_Pascal, that statement is a tad hollow. Sadly, that is the issue with Pascal. The fundamental flaws of the core Pascal language remained as described in Brian’s paper as: 1. Since the size of an array is part of its type, it is not possible to write general-purpose routines, that is, to deal with arrays of different sizes. In particular, string handling is very difficult. 2. The lack of static variables, initialization, and a way to communicate non-hierarchically combine to destroy the ‘‘locality’’ of a program — variables require much more scope than they ought to. 3. The language's one-pass nature forces procedures and functions to be presented in an unnatural order; the enforced separation of various declarations scatters program components that logically belong together. 4. The lack of separate compilation impedes the development of large programs and makes using external libraries impossible. 5. The order of logical expression evaluation cannot be controlled, which leads to convoluted code and extraneous variables. 6. The case statement is emasculated because there is no default clause. 7. The standard I/O is defective. There is no sensible provision for dealing with files or program arguments as part of the standard language and no extension mechanism. 8. The language lacks most of the tools needed for assembling large programs, most notably file inclusion. 9. There is no escape. > My grasp of the chronology is poor, though, and I'd concede that Wirth's > Pascal of his 1973 memorandum wears most or all of the mud Kernighan > threw at it. For the most part, solving the problems Kerngihan observed > did not require doing violence to the language (in other words, they > could be solved with extensions rather than by altering the semantics of > valid 1973 Pascal syntax). > You >>are<< correct that many (not all) of the defects Brian and Dave ran into when updating/rewriting the Software Tools book from FORTRAN (Ratfor) to Pascal >>could<< have been fixed. The core issue with Pascal is that if an implementation did add an extension,* every implementation did it differently.* Now, the truth is that having different implementations from different vendors was not unusual. The whole reason ASA developed FORTRAN-66, which begat the many versions that followed [I believe FORTRAN-2018 is the last verified version, but I'm told by Dr. FORTRAN [Steve Lionel] that they are preparing an updated draft. BASIC had it in spades. In fact, at one of the early 'Hatfield/McCoy" parties between my friends at HP and Tektronix, we count 14 different flavors of "HP BASIC" and 6 different flavors of "Tek Pascal." > > Maybe this is what it means to have been "directed at students"--that > easy wins in terms of language improvement were not implemented because > covering them topically in a semester-long university course would not > have been practical. If so, that may make the tragedy of Pascal > greater, not smaller. > History. Wirth was teaching at Stanford when Algol-X was proposed. In the late 1960s, the International Federation for Information Processing (IFIP) held the International responsibility for the definition of the programming language ALGOL 60, chartered IFIP Working Group 2.1 to develop a replacement for the language which had been identified, notably the lack of a standardized string subsystem. This effort was dubbed ALGOL X. You can use an internet search to find several of them, but one of the proposals was submitted by Niklaus Wirth and Tony Hoare. When their proposal was not accepted, the language ALGOL 68 would be developed under these auspices. The committee decided that the Wirth/Hoare proposal was an insufficient advance over ALGOL 60, and the proposal was published as a contribution to the development of ALGOL in CACM after making slight modifications to the language. Wirth eventually decided to develop his proposal and created an implementation targeting the IBM System/360 at Stanford University. He called the new language Algol-W [used in many universities, and I used it at CMU, and I have friends that used it at Cornell and Princeton]. At the time, Wirth's original target was a tool to teach his students at Stanford. It was not a "production quality" compiler in the sense of IBM's FORTRAN-H, which was considered the "best" production compiler at the time. Also, remember the whole assembler/compiled code was hardly a settled debate. Wulf's findings showed BLISS was as good as the best PDP-10/PDP-11 programmers, which was still a few years in the future. As I've said elsewhere, it took enough students trained in the ideas from the Green Book a few years to get into the world to start to develop the optimizers we now expect as the norm. > > Though even that doesn't explain why Wirth didn't spec Pascal functions > to admit polymorphism (perhaps optionally) in array length. That's an > essential property for all sorts of work, even that undertaken by > students. (Say you want them to write a function that multiplies > matrices. You shouldn't have to specialize the function by matrix > dimensions. The obvious pinch point for a Unix programmer is massive > inconvenience if you want to handle variable-length strings.) > Again, it was not considered an issue for what they were trying to solve. Read the IFPS meeting notes. > > Perusing its manual some months ago, it occurred to me that if someone > had had the foresight to implement the language core of Turbo Pascal > 7.0, ignoring the libraries/"modules" entirely, and omitting the crap > necessary to work with the x86 segmented memory model, I don't think > Kernighan would have had much, if anything, to complain about. And, in > my opinion, we'd be living in a better timeline. > Turbo Pascal was a flavor of Pascal that came much later. At the time, UCB Pascal and UCSD Pascal were the two popular implementations for students, and commercial folks used Whitesmiths and, later, OMSI. The first three were the compilers Brian and PJ used (I believe all were running a V7 Unix, but I'm not positive about the target OS). FWIW: today's popular Pascal is Free Pascal (freepascal.org), and it accepts all of the syntax of Turbo, Object Pascal, and Delphi as input. But it was too late. > > But I guess Pascal just didn't get good enough, soon enough. > No, it was not agreed upon. Everyone developing a Pascal implementation was pushing their flavor (sound familiar). Turbo "won" when they targeted DOS, which, at the time, was being ignored by the traditional "professional" programmers (DOS was too much of a "toy" to be serious. - Minicomputers were the bread and butter). When PharLap created an OS extension that allowed for a real 32-bit C for the 386, UNIX and C had already taken the mantel, and Pascal was losing favor in the DOS world. > > And I wonder how big such a compiler would be, even if coded > competently. > Look at OMSI, which primarily targeted the PDP-11. It has many of these extensions [it basically started as one of the Tek Pascal's, BTW. Jon and I worked with some folks who developed it.] > > Chronology bedevils us in another way. A _lot_ of C language features > its advocates are proud of were not present in 1973 or even when > Kernighan wrote CSTR #100 around 1980. Please be careful here. The point is that in 1980, Pascal and C, as described by Brain, C did not have the issues I listed above, but the popular Pascal implementations being used in the wild did. > It is an error to substitute the ANSI C of 1989 for the C that actually > existed at the time when asking > who'd win in a fight between C and Pascal taking place in, say, 1977. > I don't since I lived it. > > https://gunkies.org/wiki/Typesetter_C Need to work with Noel to update that page -- that name was because it came with the original ditroff - which used libS.a to replace Lesk's portable I/O library. It was less an issue of new compiler language features and more the requirement for a new I/O library. Remember, C and BLISS, as examples are defined without an I/O library in their origin stories. They both use the local OS's IO interface. C does not formally get an I/O library until Typesetter C and White Book. Simply, until C moved to the Honeywell and 360 nd people started to "port" code, that Lesk started to codify an IO library that could be used regardless of the local OS. Look at the documentation in V6. You'll see some interesting like fd = -1 and other artifacts that Mike had in so that earlier code would recompile and relink. With typesetter C, Dennis forced a small rewrite of your code to use libS.a. As I said, I lived this whole war. Crumudgingly yours, Clem ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From kenbob at gmail.com Mon Mar 10 05:06:12 2025 From: kenbob at gmail.com (Ken Thompson) Date: Sun, 9 Mar 2025 12:06:12 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> Message-ID: how about smalgol? it was an algol-like language with just int and float types. i dont know its history, but it came out of berkeley near when Niklaus Wirth was there. it compiled for the ibm 7094 in normal batch processing fashion. i converted it to a jit into memory in order to skip the loading phase. i used it for a lot of my fun-work. (1965-66) mainframe time, then, was a big factor in the computing process. smalgol could compile, load, and run in about 1 cpu-second. smalgol was all ibm-cards, but it was on my mind through the bcpl to b to nb phases. i would use the modern word "influencer." On Sun, Mar 9, 2025 at 10:37 AM Clem Cole wrote: > > > On Sun, Mar 9, 2025 at 9:18 AM G. Branden Robinson < > g.branden.robinson at gmail.com> wrote: > >> At 2025-03-09T08:29:43-0400, Clem Cole wrote: >> > On Sat, Mar 8, 2025 at 10:47 PM Dan Cross wrote: >> > > Was any consideration for it made? >> > > >> > Only Ken can answer that. My guess is that Algol-W would have failed >> > for the same reasons Pascal ultimately failed. It really was >> > directed at students and lacked many of the tools C had. Many of the >> > issues exposed in Brian's treatise *"Why is Pascal not my favorite >> > programming language"* also apply to Algol-W. >> >> I think it's worth noting that, as I undertand it from having read the >> paper, old Pascal manuals, and ISO 7185 (and, a long time ago, >> experience with Borland Turbo Pascal), most or all of the defects >> Kernighan pointed out were addressed by vendored versions of Pascal and >> ultimately by its ISO standard form. I think even UCSD Pascal had >> extensions, though I don't recall what they were. >> > Branden — respectfully, if you download the standard > https://archive.org/details/iso-iec-7185-1990-Pascal or look at versions > that have been converted to HTML such as > https://wiki.freepascal.org/Standard_Pascal, that statement is a tad > hollow. Sadly, that is the issue with Pascal. The fundamental flaws of > the core Pascal language remained as described in Brian’s paper as: > > 1. Since the size of an array is part of its type, it is not possible to > write general-purpose routines, that is, to deal with arrays of different > sizes. In particular, string handling is very difficult. > > 2. The lack of static variables, initialization, and a way to communicate > non-hierarchically combine to destroy the ‘‘locality’’ of a program — > variables require much more scope than they ought to. > > 3. The language's one-pass nature forces procedures and functions to be > presented in an unnatural order; the enforced separation of various > declarations scatters program components that logically belong together. > > 4. The lack of separate compilation impedes the development of large > programs and makes using external libraries impossible. > > 5. The order of logical expression evaluation cannot be controlled, which > leads to convoluted code and extraneous variables. > > 6. The case statement is emasculated because there is no default clause. > > 7. The standard I/O is defective. There is no sensible provision for > dealing with files or program arguments as part of the standard language > and no extension mechanism. > > 8. The language lacks most of the tools needed for assembling large > programs, most notably file inclusion. > > 9. There is no escape. > > > >> My grasp of the chronology is poor, though, and I'd concede that Wirth's >> Pascal of his 1973 memorandum wears most or all of the mud Kernighan >> threw at it. For the most part, solving the problems Kerngihan observed >> did not require doing violence to the language (in other words, they >> could be solved with extensions rather than by altering the semantics of >> valid 1973 Pascal syntax). >> > You >>are<< correct that many (not all) of the defects Brian and Dave ran > into when updating/rewriting the Software Tools book from FORTRAN (Ratfor) > to Pascal >>could<< have been fixed. The core issue with Pascal is that > if an implementation did add an extension,* every implementation did it > differently.* > > Now, the truth is that having different implementations from different > vendors was not unusual. The whole reason ASA developed FORTRAN-66, which > begat the many versions that followed [I believe FORTRAN-2018 is the last > verified version, but I'm told by Dr. FORTRAN [Steve Lionel] that they are > preparing an updated draft. BASIC had it in spades. In fact, at one of the > early 'Hatfield/McCoy" parties between my friends at HP and Tektronix, we > count 14 different flavors of "HP BASIC" and 6 different flavors of "Tek > Pascal." > >> >> Maybe this is what it means to have been "directed at students"--that >> easy wins in terms of language improvement were not implemented because >> covering them topically in a semester-long university course would not >> have been practical. If so, that may make the tragedy of Pascal >> greater, not smaller. >> > History. Wirth was teaching at Stanford when Algol-X was proposed. In > the late 1960s, the International Federation for Information Processing > (IFIP) held the International responsibility for the definition of the > programming language ALGOL 60, chartered IFIP Working Group 2.1 to develop > a replacement for the language which had been identified, notably the lack > of a standardized string subsystem. This effort was dubbed ALGOL X. You > can use an internet search to find several of them, but one of the > proposals was submitted by Niklaus Wirth and Tony Hoare. When their > proposal was not accepted, the language ALGOL 68 would be developed under > these auspices. > > The committee decided that the Wirth/Hoare proposal was an insufficient > advance over ALGOL 60, and the proposal was published as a contribution to > the development of ALGOL in CACM after making slight modifications to the > language. Wirth eventually decided to develop his proposal and created an > implementation targeting the IBM System/360 at Stanford University. He > called the new language Algol-W [used in many universities, and I used it > at CMU, and I have friends that used it at Cornell and Princeton]. > > At the time, Wirth's original target was a tool to teach his students at > Stanford. It was not a "production quality" compiler in the sense of IBM's > FORTRAN-H, which was considered the "best" production compiler at the time. > Also, remember the whole assembler/compiled code was hardly a settled > debate. Wulf's findings showed BLISS was as good as the best PDP-10/PDP-11 > programmers, which was still a few years in the future. As I've said > elsewhere, it took enough students trained in the ideas from the Green Book > a few years to get into the world to start to develop the optimizers we now > expect as the norm. > >> >> Though even that doesn't explain why Wirth didn't spec Pascal functions >> to admit polymorphism (perhaps optionally) in array length. That's an >> essential property for all sorts of work, even that undertaken by >> students. (Say you want them to write a function that multiplies >> matrices. You shouldn't have to specialize the function by matrix >> dimensions. The obvious pinch point for a Unix programmer is massive >> inconvenience if you want to handle variable-length strings.) >> > Again, it was not considered an issue for what they were trying to solve. > Read the IFPS meeting notes. > >> >> Perusing its manual some months ago, it occurred to me that if someone >> had had the foresight to implement the language core of Turbo Pascal >> 7.0, ignoring the libraries/"modules" entirely, and omitting the crap >> necessary to work with the x86 segmented memory model, I don't think >> Kernighan would have had much, if anything, to complain about. And, in >> my opinion, we'd be living in a better timeline. >> > Turbo Pascal was a flavor of Pascal that came much later. At the time, UCB > Pascal and UCSD Pascal were the two popular implementations for students, > and commercial folks used Whitesmiths and, later, OMSI. The first three > were the compilers Brian and PJ used (I believe all were running a V7 Unix, > but I'm not positive about the target OS). > > FWIW: today's popular Pascal is Free Pascal (freepascal.org), and it > accepts all of the syntax of Turbo, Object Pascal, and Delphi as input. > But it was too late. > >> >> But I guess Pascal just didn't get good enough, soon enough. >> > No, it was not agreed upon. Everyone developing a Pascal implementation > was pushing their flavor (sound familiar). Turbo "won" when they targeted > DOS, which, at the time, was being ignored by the traditional > "professional" programmers (DOS was too much of a "toy" to be serious. - > Minicomputers were the bread and butter). When PharLap created an OS > extension that allowed for a real 32-bit C for the 386, UNIX and C had > already taken the mantel, and Pascal was losing favor in the DOS world. > >> >> And I wonder how big such a compiler would be, even if coded >> competently. >> > Look at OMSI, which primarily targeted the PDP-11. It has many of these > extensions [it basically started as one of the Tek Pascal's, BTW. Jon > and I worked with some folks who developed it.] > >> >> Chronology bedevils us in another way. A _lot_ of C language features >> its advocates are proud of were not present in 1973 or even when >> Kernighan wrote CSTR #100 around 1980. > > Please be careful here. The point is that in 1980, Pascal and C, as > described by Brain, C did not have the issues I listed above, but the > popular Pascal implementations being used in the wild did. > > > >> It is an error to substitute the ANSI C of 1989 for the C that actually >> existed at the time when asking >> who'd win in a fight between C and Pascal taking place in, say, 1977. >> > I don't since I lived it. > >> >> https://gunkies.org/wiki/Typesetter_C > > Need to work with Noel to update that page -- that name was because it > came with the original ditroff - which used libS.a to replace Lesk's > portable I/O library. It was less an issue of new compiler language > features and more the requirement for a new I/O library. Remember, C and > BLISS, as examples are defined without an I/O library in their origin > stories. They both use the local OS's IO interface. C does not formally > get an I/O library until Typesetter C and White Book. Simply, until C > moved to the Honeywell and 360 nd people started to "port" code, that Lesk > started to codify an IO library that could be used regardless of the local > OS. Look at the documentation in V6. You'll see some interesting like fd > = -1 and other artifacts that Mike had in so that earlier code would > recompile and relink. With typesetter C, Dennis forced a small rewrite of > your code to use libS.a. > > As I said, I lived this whole war. > Crumudgingly yours, > Clem > ᐧ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rich.salz at gmail.com Mon Mar 10 05:28:05 2025 From: rich.salz at gmail.com (Rich Salz) Date: Sun, 9 Mar 2025 15:28:05 -0400 Subject: [TUHS] BWK on possible alternative programming languages In-Reply-To: <20250309172122.2578521420@orac.inputplus.co.uk> References: <202503091249.529CnHXt105980@freefriends.org> <20250309172122.2578521420@orac.inputplus.co.uk> Message-ID: On Sun, Mar 9, 2025 at 1:21 PM Ralph Corderoy wrote: > Though AWK isn't named as BWK's earlier prototype, a --BWK, but after > AWK's authors, Aho, Weinberger, and Kernighan, who also wrote the book > ‘The AWK Programming Language’. Yes, I know that. I was commenting on the similarity between the initials of one, and the acronym of the three authors and that I never noticed that before. -------------- next part -------------- An HTML attachment was scrubbed... URL: From douglas.mcilroy at dartmouth.edu Mon Mar 10 05:37:58 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Sun, 9 Mar 2025 15:37:58 -0400 Subject: [TUHS] BWK on possible alternative programming languages Message-ID: Adding to Brian's remarks. Both PL/I, which had been adopted by Multics, and BCPL/B were very familiar. PL/I , even gutted as it had been for Multics, was much too big to contemplate. BCPL's integration of subscripting and pointers was nice, as was its closeness to the machine. Typlessness was a drawback: how would one integrate floating point or characters? Another was the global vector, like Fortran COMMON. Algol W was known (to me, at least) only via its publication in CACM. I don't recall it having been considered. Because Algol W had more concepts than BCPL,was not as closely matched to machine-level coding, and (I believe) was equally lacking in separate-compilation facilities, I suspect it would not have made the cut. Doug -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.branden.robinson at gmail.com Mon Mar 10 05:41:52 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Sun, 9 Mar 2025 14:41:52 -0500 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> Message-ID: <20250309194152.tjgibbix55nhuntg@illithid> Hi Clem, Thanks for you detailed response. I appreciate getting perspectives from people who fought in the wars I'm too young to remember; I visit the memorials and try to imagine what things were like. At 2025-03-09T13:29:55-0400, Clem Cole wrote: > On Sun, Mar 9, 2025 at 9:18 AM G. Branden Robinson < > g.branden.robinson at gmail.com> wrote: > > At 2025-03-09T08:29:43-0400, Clem Cole wrote: > > > On Sat, Mar 8, 2025 at 10:47 PM Dan Cross wrote: > > > > Was any consideration for it made? > > > Only Ken can answer that. My guess is that Algol-W would have > > > failed for the same reasons Pascal ultimately failed. It really > > > was directed at students and lacked many of the tools C had. Many > > > of the issues exposed in Brian's treatise *"Why is Pascal not my > > > favorite programming language"* also apply to Algol-W. > > > > I think it's worth noting that, as I undertand it from having read > > the paper, old Pascal manuals, and ISO 7185 (and, a long time ago, > > experience with Borland Turbo Pascal), most or all of the defects > > Kernighan pointed out were addressed by vendored versions of Pascal > > and ultimately by its ISO standard form. I think even UCSD Pascal > > had extensions, though I don't recall what they were. > > > Branden — respectfully, if you download the standard > https://archive.org/details/iso-iec-7185-1990-Pascal I keep a copy on my tablet! Alongside two Free Pascal and the Turbo Pascal 7.0 manuals. I didn't have to fight in the trenches using Pascal for very long (and when I did, it was with Borland's generously expanded dialects, a few years apart). > or look at versions that have been converted to HTML such as > https://wiki.freepascal.org/Standard_Pascal, that statement is a tad > hollow. Sadly, that is the issue with Pascal. Okay. I'll take it as homework to double-check how feedble ISO Pascal is compared to the version critiqued by Kernighan. Hmm, I can already see that I might have confused ISO Standard Pascal (ISO 7185) with ISO Extended Pascal (ISO 10206). Both standards, but only one named "Standard". I apologize for my error. > The fundamental flaws of the core Pascal language remained as > described in Brian’s paper as: > > 1. Since the size of an array is part of its type, it is not possible > to write general-purpose routines, that is, to deal with arrays of > different sizes. In particular, string handling is very difficult. Yeah. That's a brutal flaw that every defender of Wirth Pascal has to concede at the risk of not being taken seriously. > 2. The lack of static variables, initialization, and a way to > communicate non-hierarchically combine to destroy the ‘‘locality’’ of > a program — variables require much more scope than they ought to. That's also a flaw, but C doesn't distinguish itself strongly here; `static` function linkage should have been the default. We should have had a `public` keyword (or synonym) instead. (Rust, to my great surprise, seems to have convinced people that `const` default is good too--hence `let mut`--and it is, for many reasons including concurrency. Even my beloved Ada was not so forward-looking.) > 3. The language's one-pass nature forces procedures and functions to > be presented in an unnatural order; the enforced separation of various > declarations scatters program components that logically belong > together. But that was true of C for a long time as well; as I understand it, one-pass C compilers were important enough that intermixed statements and declarations were allowed into the standard only in C99. That's a long time after CSTR #100. And for me, writing to minimize forward declarations _is_ the natural order. Just as in documentation, we want to avoid employing concepts before we've introduced them. (If your `main()` is either really short or strictly a sequence of calls to functions returning `void`, such that it reads like a procedural checklist, only then would I tolerate it at the top of the source file.) > 4. The lack of separate compilation impedes the development of large > programs and makes using external libraries impossible. This one's about as brutal as point 1. It requires concession. Modularity is critical. > 5. The order of logical expression evaluation cannot be controlled, > which leads to convoluted code and extraneous variables. Some of this blows back onto C. Yes, short-circuit logical evaluation operators are desirable, but apart from that the order of expression evaluation in C (a) is frequenly not specified so that compilers can generate fast code for the hardware; or (b) demands that the programmer remember something like 15 levels of operator precedence, which is savage cruelty in my opinion. Here I am recently correcting a dopey mistake of mine along those lines. https://git.savannah.gnu.org/cgit/groff.git/commit/?id=a0171c9dc12daa8e70b9178bfac63aa11745110b > 6. The case statement is emasculated because there is no default > clause. I'll freely grant this one, too, and go still farther and say that both Pascal _and_ C should have required Haskell-style exhaustive coverage of the controlling variable's domain. C's support for sum types was (and is) bad, and Pascal's started down the right road but then gave it up partway, to amplify Kernighan's point. > 7. The standard I/O is defective. There is no sensible provision for > dealing with files or program arguments as part of the standard language > and no extension mechanism. To my horror, when learning Java I discovered that it had managed to discard the basic competence of console/terminal I/O supported by C's stdio library. I don't dispute that C shone here. I think Korn and Vo have solid critiques of other aspects of stdio, and I find the irregularity of function names, argument order, and some aspects of behavior (such as whether an output function appends a newline or not) annoying, but I would not dispute that it's easier than in most languages I've dealt with to get a simple "conversational" terminal application stood up with C. Yes, you often want a "serious" UI in curses or a windowing system later (or none at all, relying on argv[] and the standard I/O streams to do everything as in the filter model), but that's no reason to fumble the ball on simple Teletype-style I/O. Yet too many languages did. Still, scanf() was a bad idea, or at least overapplied, and should not be taught to students as a tool for interactive I/O handling (as it was to me--did anyone else have this misfortune?). For that, a simple lexical analyzer is necessary because humans produce undisciplined input. It's never too early to teach a beginner how to write a finite state machine. > 8. The language lacks most of the tools needed for assembling large > programs, most notably file inclusion. Once you have separate compilation, this seems mostly like either (a) a linker problem or (b) a problem better solved by a real macro language like m4. The C preprocessor was merely adequate when it could have been brilliant. In case anyone feels a dark premonition of where that idea leads: yes, I'm saying the CSRC should have given us Autoconf instead of the C preprocessor. >:-) If they had, it would be far less hated. > 9. There is no escape. And that's why everybody added inline assembly and some added foreign function interfaces. > You >>are<< correct that many (not all) of the defects Brian and Dave > ran into when updating/rewriting the Software Tools book from FORTRAN > (Ratfor) to Pascal >>could<< have been fixed. The core issue with > Pascal is that if an implementation did add an extension,* every > implementation did it differently.* > > Now, the truth is that having different implementations from different > vendors was not unusual. The whole reason ASA developed FORTRAN-66, > which begat the many versions that followed [I believe FORTRAN-2018 is > the last verified version, but I'm told by Dr. FORTRAN [Steve Lionel] > that they are preparing an updated draft. BASIC had it in spades. In > fact, at one of the early 'Hatfield/McCoy" parties between my friends > at HP and Tektronix, we count 14 different flavors of "HP BASIC" and 6 > different flavors of "Tek Pascal." I guess standardization of Pascal started too late; but both C and C++ took a long time to get to their initial standards too. What do you think accounts for C and C++'s greater success, in the specific sense that vendor extensions didn't cripple the language's development? What force kept these languages better governed? > > Maybe this is what it means to have been "directed at > > students"--that easy wins in terms of language improvement were not > > implemented because covering them topically in a semester-long > > university course would not have been practical. If so, that may > > make the tragedy of Pascal greater, not smaller. > > > History. Wirth was teaching at Stanford when Algol-X was proposed. In > the late 1960s, the International Federation for Information > Processing (IFIP) held the International responsibility for the > definition of the programming language ALGOL 60, chartered IFIP > Working Group 2.1 to develop a replacement for the language which had > been identified, notably the lack of a standardized string subsystem. > This effort was dubbed ALGOL X. You can use an internet search to find > several of them, but one of the proposals was submitted by Niklaus > Wirth and Tony Hoare. When their proposal was not accepted, the > language ALGOL 68 would be developed under these auspices. I've read Lindsey's "A history of ALGOL 68" in HOPL 2 (1996). Intriguing stuff. I have no strong opinions about any of it, probably because no one's either forced me to write in ALGOL 68, nor forbidden me from doing so. I do admire its nearly universal loop structure: [ for index ] [ from first ] [ by increment ] [ to last ] [ while condition ] do statements od "Nearly" because it's not obvious to me how you'd simulate C's do-while, which is underrated and under-taught. And in case anyone's not aware of it, you can play with Algol 68 today. https://www.theregister.com/2025/01/07/algol_68_comes_to_gcc/ > The committee decided that the Wirth/Hoare proposal was an > insufficient advance over ALGOL 60, and the proposal was published as > a contribution to the development of ALGOL in CACM after making slight > modifications to the language. Wirth eventually decided to develop > his proposal and created an implementation targeting the IBM > System/360 at Stanford University. He called the new language Algol-W > [used in many universities, and I used it at CMU, and I have friends > that used it at Cornell and Princeton]. > > At the time, Wirth's original target was a tool to teach his students > at Stanford. It was not a "production quality" compiler in the sense > of IBM's FORTRAN-H, which was considered the "best" production > compiler at the time. Also, remember the whole assembler/compiled > code was hardly a settled debate. That was still a live issue when I was learning, but back then (on home 8-bit micros), you had to shell out money to get anything but the BASIC interpreter that was stored in ROM. We penurious kids, if pressed, would hand-assemble code to and poke it into memory to escape the shackles of the vendor's languid BASIC interpreter (usually under license from Microsoft). Made a lot of mistakes and froze the machine that way. No illegal instruction traps and no handler for such traps if we'd had 'em. Just push reset and hope you'd saved your work. "a.out?" https://www.nokia.com/bell-labs/about/dennis-m-ritchie/odd.html > Wulf's findings showed BLISS was as good as the best PDP-10/PDP-11 > programmers, which was still a few years in the future. As I've said > elsewhere, it took enough students trained in the ideas from the Green > Book a few years to get into the world to start to develop the > optimizers we now expect as the norm. I'd like to talk to some of that cohort about Ada. > > Though even that doesn't explain why Wirth didn't spec Pascal > > functions to admit polymorphism (perhaps optionally) in array > > length. That's an essential property for all sorts of work, even > > that undertaken by students. (Say you want them to write a function > > that multiplies matrices. You shouldn't have to specialize the > > function by matrix dimensions. The obvious pinch point for a Unix > > programmer is massive inconvenience if you want to handle > > variable-length strings.) > > > Again, it was not considered an issue for what they were trying to > solve. Read the IFPS meeting notes. That shocks me, but okay. I realize I'm probably abusing the term "polymorphism" here, since that's usually applied to distinguishable data types. It bluescreens my brain to think that "iterate along this vector of identically typed elements until reaching a known terminating value" was regarded as a technique with any hint of the exotic. What am I missing? > > Perusing its manual some months ago, it occurred to me that if > > someone had had the foresight to implement the language core of > > Turbo Pascal 7.0, ignoring the libraries/"modules" entirely, and > > omitting the crap necessary to work with the x86 segmented memory > > model, I don't think Kernighan would have had much, if anything, to > > complain about. And, in my opinion, we'd be living in a better > > timeline. > > > Turbo Pascal was a flavor of Pascal that came much later. Yes, TP 1.0 was late 1983; Wirth's spec paper was 1973, UCSD Pascal 1977, and I can't find a date for UCB Pascal, but... > At the time, UCB Pascal ...Thompson had a hand in writing that one, didn't he? During his sabbatical that spawned the CSRG? If so, that was 1975. > and UCSD Pascal were the two popular implementations for students, and > commercial folks used Whitesmiths and, later, OMSI. The first three > were the compilers Brian and PJ used (I believe all were running a V7 > Unix, but I'm not positive about the target OS). > > FWIW: today's popular Pascal is Free Pascal (freepascal.org), and it > accepts all of the syntax of Turbo, Object Pascal, and Delphi as > input. But it was too late. It's still a highly active project, and one I keep an eye on. On the one hand, ports to RISC-V and WASM, and on the other, fixes to its m68k backend and TP-style text-mode IDE have all seen commits in the past month. They also seem to take their documentation seriously, a significant sign of health in my view. I have no expectation that it will displace C, but its developers have my admiration for pushing forward. Maybe that's because, in maintaining a typesetting system that _isn't_ TeX, I recognize potentially kindred spirits. > > But I guess Pascal just didn't get good enough, soon enough. > > > No, it was not agreed upon. Everyone developing a Pascal > implementation was pushing their flavor (sound familiar). Turbo "won" > when they targeted DOS, which, at the time, was being ignored by the > traditional "professional" programmers (DOS was too much of a "toy" to > be serious. - Well, the professionals weren't wrong. MS-DOS was a crap OS on a crap ISA. They were both market placeholders. But we don't work, or live, in a meritocracy. As ever, the surest route to the head of the line is to be deposited there at birth by your ancestor. > Minicomputers were the bread and butter). When PharLap created an OS > extension that allowed for a real 32-bit C for the 386, UNIX and C had > already taken the mantel, and Pascal was losing favor in the DOS > world. > > > And I wonder how big such a compiler would be, even if coded > > competently. > > > Look at OMSI, which primarily targeted the PDP-11. It has many of > these extensions [it basically started as one of the Tek Pascal's, > BTW. Jon and I worked with some folks who developed it.] Oof, for RT-11. I've booted RSX-11--once--on my PiDP-11. It was a jarring experience. Booting 2.11BSD was headily nostalgic, and took me _way_ back. I haven't seen a boot sequence like that in a long time. > > Chronology bedevils us in another way. A _lot_ of C language > > features its advocates are proud of were not present in 1973 or even > > when Kernighan wrote CSTR #100 around 1980. > > Please be careful here. The point is that in 1980, Pascal and C, as > described by Brain, C did not have the issues I listed above, I can't completely agree with you here because as I understand the critique, C still has _some_ of them _today_. Often not as badly as in the past, and almost certainly not as bad as the Pascals of the day. > but the popular Pascal implementations being used in the wild did. > > > It is an error to substitute the ANSI C of 1989 for the C that > > actually existed at the time when asking who'd win in a fight > > between C and Pascal taking place in, say, 1977. > > I don't since I lived it. Yes, but I'm not always aiming at you personally. :) > > https://gunkies.org/wiki/Typesetter_C > > Need to work with Noel to update that page -- that name was because it > came with the original ditroff - which used libS.a to replace Lesk's > portable I/O library. I was (am) a bit confused because like you I thought "Typesetter C" was something else, something later, that was described in the 1-page article in the V7 manual "Recent Changes to C", which introduced (per its headings) "Structure assignment" and [an] "Enumeration type". _But_, that's dated November 15, 1978, which seems a tad early for Kernighan's device-independent troff work. "...in the spring of 1979, I set about to modify TROFF so that it would run henceforth without change on a variety of typesetters." (CSTR #97) London and Reiser's 1979 paper describing their group's port of Unix to the VAX-11 (I gather from other sources that the CSRC was upset with DEC and refused to do it) identifies barriers that the C of the day erected to portable programming. Most of them were addressed by ANSI C. I had thought that device-independent troff directly drove some changes to the C language (not just support libraries), but maybe I'm wrong. I have no idea how the Ritchie compiler or PCC evolved after Seventh Edition (1979). Apart from a couple of books (Hancock & Krieger, _C Primer Plus_, and _Notes on the Draft C Standard_, Tom Plum), I have no insight into 1980s C. The main thing I know about it is that if an implementation targeted x86, it added `near` and `far` keywords to accommodate segmented memory and/or DOS-style object file formats. I'd have learned C (or tried to) a lot younger if I could have afforded a compiler for it. Instead I had to wait for GCC. And a real computer. > It was less an issue of new compiler language features and more the > requirement for a new I/O library. Remember, C and BLISS, as examples > are defined without an I/O library in their origin stories. They both > use the local OS's IO interface. C does not formally get an I/O > library until Typesetter C and White Book. Simply, until C moved to > the Honeywell and 360 nd people started to "port" code, that Lesk > started to codify an IO library that could be used regardless of the > local OS. Look at the documentation in V6. You'll see some > interesting like fd = -1 and other artifacts that Mike had in so that > earlier code would recompile and relink. With typesetter C, Dennis > forced a small rewrite of your code to use libS.a. Well, you _could_ bypass it and use read() and write() directly. Say, in cat(1), the way Pike intended. ;-) > As I said, I lived this whole war. I appreciate hearing it and I don't mind being corrected (or challenged on my questionable taste). I'm a contrarian about C because triumphalism and other forms of sore winning are not good looks for engineers. Every time I blunder into Pascal-style pseudocode, I experience a pleasant jolt of recognition. Warmest regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From tuhs at tuhs.org Mon Mar 10 05:57:24 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sun, 9 Mar 2025 12:57:24 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309194152.tjgibbix55nhuntg@illithid> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: <17CBDD1B-20EA-4993-AD44-A2FF6AF54416@iitbombay.org> On Mar 9, 2025, at 12:41 PM, G. Branden Robinson wrote: > > And in case anyone's not aware of it, you can play with Algol 68 today. > > https://www.theregister.com/2025/01/07/algol_68_comes_to_gcc/ a68 has been available for quite a while on Unix (FreeBSD): See https://www.freshports.org/lang/algol68g The Algol 68 Genie project preserves Algol 68 out of educational as well as scientific-historical interest, by making a modern, well-featured hybrid compiler-interpreter written from scratch, which ranks among the most complete Algol 68 implementations. It features arbitrary precision arithmetic, complex numbers, parallel processing, partial parameterization, and formatted transput, as well as support for curses, regular expressions, and sound. From johnl at taugh.com Mon Mar 10 06:13:44 2025 From: johnl at taugh.com (John Levine) Date: 9 Mar 2025 16:13:44 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: <20250309201345.02C6DBF5512D@ary.qy> It appears that Dan Cross said: >My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was >already mentioned. I don't think PL/1 (or PL/I) could have fit on >those machines. ... There were a lot of PL/I subsets or variants used for system programming. Intel had PL/M, used to write CP/M. IBM had PL/S, used to write some parts of OS/360, and reimplemented at RAND in the early 1970s. XPL was writeen at Stanford in the late 1960s, intended for writing compilers with a small one-pass compiler written in itself. PL/360 was sort of PL/I-ish although it was really an IBM 360 assembler with Algol like syntax, used to write Algol W. One of these could have been a reasonable basis for a system language, but I don't think the result would have been any better than C. R's, John From luther.johnson at makerlisp.com Mon Mar 10 06:35:06 2025 From: luther.johnson at makerlisp.com (Luther Johnson) Date: Sun, 9 Mar 2025 13:35:06 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309201345.02C6DBF5512D@ary.qy> References: <20250309201345.02C6DBF5512D@ary.qy> Message-ID: <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> I believe CP/M was written entirely in 8080 assembly language. PL/M was a PL/1 subset, I think Gary Kildall was the main programmer behind that, and I'm sure there was a version for CP/M, but I doubt CP/M was written in it, just due to what I've seen of it. I have a port of CP/M for a machine I've made and sold, where another programmer and I did the porting work, and from what I've seen of early DOS and how it was in many ways modeled after CP/M, it doesn't seem like CP/M was written in anything other than 8080 assembly. However I've only seen the BIOS (not IBM PC BIOS, but the original coining of the term for CP/M, standing for "basic input/output system"), so on the other side, inside of CP/M, I guess it might be anything, but it seems like it is most likely 8080 assembly language too. On 03/09/2025 01:13 PM, John Levine wrote: > It appears that Dan Cross said: >> My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was >> already mentioned. I don't think PL/1 (or PL/I) could have fit on >> those machines. ... > There were a lot of PL/I subsets or variants used for system > programming. Intel had PL/M, used to write CP/M. IBM had PL/S, used to > write some parts of OS/360, and reimplemented at RAND in the early > 1970s. XPL was writeen at Stanford in the late 1960s, intended for > writing compilers with a small one-pass compiler written in itself. > > PL/360 was sort of PL/I-ish although it was really an IBM 360 assembler with > Algol like syntax, used to write Algol W. > > One of these could have been a reasonable basis for a system language, but I > don't think the result would have been any better than C. > > R's, > John > From clemc at ccc.com Mon Mar 10 06:58:57 2025 From: clemc at ccc.com (Clem Cole) Date: Sun, 9 Mar 2025 16:58:57 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> References: <20250309201345.02C6DBF5512D@ary.qy> <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> Message-ID: On Sun, Mar 9, 2025 at 4:35 PM Luther Johnson wrote: > I believe CP/M was written entirely in 8080 assembly language. Not initially -- see below. > PL/M wasa PL/1 subset, I think Gary Kildall was the main programmer behind > that, > Gary was a language person not an OS person. PL/M was developed by him under contract with Intel for the original Intel 8080 development system. > and I'm sure there was a version for CP/M, but I doubt CP/M was written > in it, just due to what I've seen of it. >From the readme: https://github.com/brouhaha/cpm22 Introduced in 1974, CP/M by Digital Research was one of the first microcomputer operating systems that was not tied to a single computer vendor. It could be adapted to run on almost any 8080 or Z80 microcomputer that had at least 16KB of RAM starting at address 0000h. Originally much of CP/M was written in the PL/M programming language. With the introduction of CP/M 2.0, the command processor (CCP) and kernel (BDOS) were rewritten in 8080 assembly language. > I have a port of CP/M for a > machine I've made and sold, where another programmer and I did the > porting work, and from what I've seen of early DOS and how it was in > many ways modeled after CP/M, it doesn't seem like CP/M was written in > anything other than 8080 assembly. However I've only seen the BIOS (not > IBM PC BIOS, but the original coining of the term for CP/M, standing for > "basic input/output system"), Again, the term came from the Intel 8008 development system. > so on the other side, inside of CP/M, I > guess it might be anything, but it seems like it is most likely 8080 > assembly language too. > As a compiler guy, Kidall was once quoted as the reason who originally wrote CP/M to sell more copies of his compiler. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From paul at mcjones.org Mon Mar 10 07:01:03 2025 From: paul at mcjones.org (Paul McJones) Date: Sun, 9 Mar 2025 14:01:03 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <174154718981.615624.15831772136951719489@minnie.tuhs.org> References: <174154718981.615624.15831772136951719489@minnie.tuhs.org> Message-ID: Ken, Was smalgol also known as BC Algol, as described here: https://www.softwarepreservation.org/projects/ALGOL/algol60impl/#BC_ALGOL > On Mar 9, 2025, at 12:06 PM, Ken Thompson > > wrote: > > how about smalgol? > > it was an algol-like language with just int and float types. > i dont know its history, but it came out of berkeley near > when Niklaus Wirth was there. it compiled for the ibm 7094 > in normal batch processing fashion. i converted it to a jit > into memory in order to skip the loading phase. i used > it for a lot of my fun-work. (1965-66) > > mainframe time, then, was a big factor in the computing process. > smalgol could compile, load, and run in about 1 cpu-second. > > smalgol was all ibm-cards, but it was on my mind through > the bcpl to b to nb phases. i would use the modern word > "influencer.” Paul McJones -------------- next part -------------- An HTML attachment was scrubbed... URL: From luther.johnson at makerlisp.com Mon Mar 10 07:12:01 2025 From: luther.johnson at makerlisp.com (Luther Johnson) Date: Sun, 9 Mar 2025 14:12:01 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309201345.02C6DBF5512D@ary.qy> <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> Message-ID: I see, thank you for the history and clarification. The CP/M I have worked with is CP/M 2.2. But as I said, the OS inside could have been written in anything, and now I know in earlier versions, it was something else, thank you. On 03/09/2025 01:58 PM, Clem Cole wrote: > > > On Sun, Mar 9, 2025 at 4:35 PM Luther Johnson > > > wrote: > > I believe CP/M was written entirely in 8080 assembly language. > > Not initially -- see below. > > PL/M wasa PL/1 subset, I think Gary Kildall was the main > programmer behind that, > > Gary was a language person not an OS person. PL/M was developed by > him under contract with Intel for the original Intel 8080 development > system. > > and I'm sure there was a version for CP/M, but I doubt CP/M was > written > in it, just due to what I've seen of it. > > From the readme: https://github.com/brouhaha/cpm22 > > Introduced in 1974, CP/M by Digital Research was one of the first > microcomputer operating systems that was not tied to a single > computer vendor. It could be adapted to run on almost any 8080 or > Z80 microcomputer that had at least 16KB of RAM starting at > address 0000h. > > > Originally much of CP/M was written in the PL/M programming > language. With the introduction of CP/M 2.0, the command processor > (CCP) and kernel (BDOS) were rewritten in 8080 assembly language. > > I have a port of CP/M for a > machine I've made and sold, where another programmer and I did the > porting work, and from what I've seen of early DOS and how it was in > many ways modeled after CP/M, it doesn't seem like CP/M was written in > anything other than 8080 assembly. However I've only seen the BIOS > (not > IBM PC BIOS, but the original coining of the term for CP/M, > standing for > "basic input/output system"), > > Again, the term came from the Intel 8008 development system. > > so on the other side, inside of CP/M, I > guess it might be anything, but it seems like it is most likely 8080 > assembly language too. > > > As a compiler guy, Kidall was once quoted as the reason who originally > wrote CP/M to sell more copies of his compiler. > ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From dave at horsfall.org Mon Mar 10 08:47:33 2025 From: dave at horsfall.org (Dave Horsfall) Date: Mon, 10 Mar 2025 09:47:33 +1100 (AEDT) Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309194152.tjgibbix55nhuntg@illithid> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: On Sun, 9 Mar 2025, G. Branden Robinson wrote: > [...] I appreciate getting perspectives from people who fought in the > wars I'm too young to remember; I visit the memorials and try to imagine > what things were like. I really enjoyed using ALGOLW in my student days, but not so much Pascal (especially the one that ran under CDC KRONOS, as its read-ahead meant that you were presented with a "?" prompt before any output was seen). -- Dave From imp at bsdimp.com Mon Mar 10 08:57:00 2025 From: imp at bsdimp.com (Warner Losh) Date: Sun, 9 Mar 2025 16:57:00 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309201345.02C6DBF5512D@ary.qy> <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> Message-ID: On Sun, Mar 9, 2025 at 2:59 PM Clem Cole wrote: > > > On Sun, Mar 9, 2025 at 4:35 PM Luther Johnson < > luther.johnson at makerlisp.com> wrote: > >> I believe CP/M was written entirely in 8080 assembly language. > > Not initially -- see below. > >> PL/M wasa PL/1 subset, I think Gary Kildall was the main programmer >> behind that, >> > Gary was a language person not an OS person. PL/M was developed by him > under contract with Intel for the original Intel 8080 development system. > >> and I'm sure there was a version for CP/M, but I doubt CP/M was written >> in it, just due to what I've seen of it. > > From the readme: https://github.com/brouhaha/cpm22 > > Introduced in 1974, CP/M by Digital Research was one of the first > microcomputer operating systems that was not tied to a single computer > vendor. It could be adapted to run on almost any 8080 or Z80 microcomputer > that had at least 16KB of RAM starting at address 0000h. > > > Originally much of CP/M was written in the PL/M programming language. With > the introduction of CP/M 2.0, the command processor (CCP) and kernel (BDOS) > were rewritten in 8080 assembly language. > > > >> I have a port of CP/M for a >> machine I've made and sold, where another programmer and I did the >> porting work, and from what I've seen of early DOS and how it was in >> many ways modeled after CP/M, it doesn't seem like CP/M was written in >> anything other than 8080 assembly. However I've only seen the BIOS (not >> IBM PC BIOS, but the original coining of the term for CP/M, standing for >> "basic input/output system"), > > Again, the term came from the Intel 8008 development system. > > > >> so on the other side, inside of CP/M, I >> guess it might be anything, but it seems like it is most likely 8080 >> assembly language too. >> > > As a compiler guy, Kidall was once quoted as the reason who originally > wrote CP/M to sell more copies of his compiler. > Isn't that what Unix did for C Warner > ᐧ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Mon Mar 10 08:58:14 2025 From: clemc at ccc.com (Clem Cole) Date: Sun, 9 Mar 2025 18:58:14 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309194152.tjgibbix55nhuntg@illithid> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: On Sun, Mar 9, 2025 at 3:42 PM G. Branden Robinson < g.branden.robinson at gmail.com> wrote: > Hi Clem, > > Thanks for you detailed response. Most welcome. > I appreciate getting perspectives > from people who fought in the wars I'm too young to remember; I visit > the memorials and try to imagine what things were like. > I think a huge issue is your introduction to computing was on a PC, not on a mainframe or even a mini. IMHO the #1 biggest issue with the PC were so little learnings from either systems was used and so much was re-invented. As some on once said, instead of standing on the shoulders of the giants from before, computer scientists seems to step each other other toes. > > Hmm, I can already see that I might have confused ISO Standard Pascal > (ISO 7185) with ISO Extended Pascal (ISO 10206). Both standards, but > only one named "Standard". I apologize for my error. > No worries, by 1990 when ISO 10206 came about, Pascal had already morphed even by Wirth himself. I liked Modula-II (and so called Mod-III the DEC SRC implementation). One of my favorite early workstations was Wirth's Lilith a Mod-22 based system that was an Alto clone like the Perq (a.k.a the PascAlto). > > > That's also a flaw, but C doesn't distinguish itself strongly here; > `static` function linkage should have been the default. You can argue both ways. Asmblers of the day worked like C did. > > But that was true of C for a long time as well; as I understand it, > Hmm look at the sources or at least at Dennis' Tour document. Ken or Steve Bourne can correct me he as I started with Fifth Edition and pretty sure Dennis' compiler was multipass. Some passed were separate programs due to the address space limits of the PDP-11. But by 1978 and Typesetter C, much less 80/81 when Brian and PJ did SW Tools in Pascal, Brian's statement hold. > > And for me, writing to minimize forward declarations _is_ the natural > order. Hey, I was taught that way also. I knew FORTRAN, Algol-W, SAIL, BLISS, as well as Pascal (much less a few assembler languages) before I learned C and the PDP-11. The White book did not exist, BTW. I started learning before Lesk's IO library - although we got V6, I have few memories of anything interesting by me being written without using Lesk's library or later libS.a from Typesetter C. I understand Brian's complaint, and coming from Ratfor/FORTRAN, Pascal's rigor certainly would have been awkward, but I liked having the flexibility that C provided. Frankly, I hate more modern C code when the first thing you see is a lot of functions, and then main() is declared much later. But then again, I like to keep main() and maybe one or two primary functions in the program.c file and all my subroutines in either subr.c or like, then in program.h declare everything. > > > > 5. The order of logical expression evaluation cannot be controlled, > > which leads to convoluted code and extraneous variables. > > Some of this blows back onto C. Yes, short-circuit logical evaluation > operators are desirable, Absolutely. > but apart from that the order of expression > evaluation in C (a) is frequenly not specified so that compilers can > generate fast code for the hardware; or (b) demands that the programmer > remember something like 15 levels of operator precedence, which is > savage cruelty in my opinion. > That's more about the implementation of a coding style. Post students of Wulf's Green Book, the compiler should not care. It should generate optimal code. Anyway, I frankly prefer to see explicit parens, so the precedence is clear — it's easier to read and less error-prone for the maintainer. > > Still, scanf() was a bad idea, or at least overapplied, Remember printf() and scanf() come from FORTRAN in heritage. The idea was that the format statements worked in both places. I'll not argue that scanf() is a mess; I have generally avoided it. > and should not > be taught to students as a tool for interactive I/O handling (as it was > to me--did anyone else have this misfortune?). I seriously hope you were not given the C Programming Language as a teaching language — shame on your instructors. We don't put new pilots in an F15. Even in the military, they start on straightforward fixed-wing aircraft similar to the Piper Cub or Cessna 150 [just as my father did when he was a fighter instructor for the Army in the 1940s, BTW]. Use the tools for what they were designed for. I still think Pascal is the best teaching language around, and Clancy and Cooper's book "Oh Pascal" is the best way to introduce programmers to systems thinking. But give them professional tools as they start to do harder things, and their skills will improve. It's funny as she has since graduated college and works for a local SW firm, but a young woman I mentored a few years ago; I taught her using "Oh Pascal" while her HS was trying to use C++ (and she was very confused by it - the teacher was poor and the book wasn't great either — Dietel and Dietel). She later went to WPI, and she thanked me for "forcing her" to learn to do the simple stuff and understand it. She saw Java in college courses, and things made much more sense to her. > For that, a simple > lexical analyzer is necessary because humans produce undisciplined > input. It's never too early to teach a beginner how to write a finite > state machine. > I agree — everyone should write for their first compiler in Pascal for a simple language and no cheating using YACC. You need to write the whole thing if you want to understand how parsing really works. > > I guess standardization of Pascal started too late; but both C There was a standard. Everyone had the sources to Dennis and later Steve's compilers. 😉 > and C++ took a long time to get to their initial standards too. What do > you > think accounts for C and C++'s greater success, in the specific sense > that vendor extensions didn't cripple the language's development? What > force kept these languages better governed? > In C's case, the committees were controlled as long as Dennis was there. Two of Dennis's most famous quotes are: *"When I read commentary about suggestions for where C should go, I often think back and give thanks that it wasn't developed under the advice of a worldwide crowd."* and *“C is quirky, flawed, and an enormous success.”* I find where the C standard folks falling into the C++ trap and letting people pee it. CC++ is a disaster — primarily because everyone added their favorite feature. That's while I like Go for applications work, and if I were to do OS (near the HW development) today and not use C, I would have to consider Rust. > > > I've read Lindsey's "A history of ALGOL 68" in HOPL 2 (1996). > Intriguing stuff. I have no strong opinions about any of it, probably > because no one's either forced me to write in ALGOL 68, nor forbidden me > from doing so. I do admire its nearly universal loop structure: > I'll leave it to srb to reply to that. CMU had a budding A68 team led by Peter Hibbard, but I never used it. I had Peter for compilers, so we read a number of the papers at the time, but after I graduated, work that led to Ada had started. I liked what I saw with Red; it follows 'Tartan,' which Wulf and his students, like Hilfiger, put together. as a "Hail Mary" when it started to be clear Green was going to "win." When I was a grad student, and Paul had finished his PhD and started teaching at UCB, I took his comparative language seminar. I learned to hate Ada (*a.k.a.* "Green") a bit less. But that's a different email thread with fewer Unix orientations, so it belongs elsewhere. > > > That was still a live issue when I was learning, but back then (on home > 8-bit micros), you had to shell out money to get anything but the BASIC > interpreter that was stored in ROM. > I understand. The more significant issue is that "production quality" compilers did not target those systems (yet). This came up a while back on Quora: https://www.quora.com/When-did-C-compilers-start-generating-optimized-code-that-runs-faster-than-average-assembly-programmers-code/answer/Clem-Cole I'd like to talk to some of that cohort about Ada. > Sure, move it, COFF. > > That shocks me, but okay ... What am I missing? > Simple: a different set of problems at the time. Read my answer: https://www.quora.com/How-did-we-make-the-first-computer-operating-system-if-we-didnt-have-computers-to-make-them-on/answer/Clem-Cole > > > > ...Thompson had a hand in writing that one, didn't he? During his > sabbatical that spawned the CSRG? If so, that was 1975. > Pascal escapes UCB with what we now call "1BSD," the original BSD tape that Joy wrote either 77 or 78. It's written in a flavor of pre-typesetter C - which was a UCB modified V6 dmr compiler (and eyacc UCB' extended Yacc IIRC) > > I have no expectation that it will displace C, but its developers have > my admiration for pushing forward. Maybe that's because, in maintaining > a typesetting system that _isn't_ TeX, I recognize potentially kindred > spirits. > I've used Free Pascal as a teaching tool for the last few years. They did a nice job. > > Well, the professionals weren't wrong. MS-DOS was a crap OS on a crap > ISA. They were both market placeholders. But we don't work, or live, > in a meritocracy. As ever, the surest route to the head of the line is > to be deposited there at birth by your ancestor. > Cole's Law: "*Simple Economics always beats Sophisticated Design*" The PCs were cheap, and >>eventually<< people got them to do the things they needed. This is classic Christenson 'Disruptive Technology." The "lessor" technology beat the "better" one by creating a new market with different set of customer and can build faster and over time the features of the 'lessor' technology will catch up because its on a faster growth curve. > Oof, for RT-11. OMSI Pascal ran on RT/RSX/RSTS and Unix. But they sold the most copies for RT and RSX. > I've booted RSX-11--once--on my PiDP-11. It was a jarring experience. Yep. > Booting 2.11BSD was headily nostalgic, and took me > _way_ back. I haven't seen a boot sequence like that in a long time. > Did you try V5, V6 or V7? > > > I can't completely agree with you here because as I understand the > critique, C still has _some_ of them _today_. Often not as badly as in > the past, and almost certainly not as bad as the Pascals of the day. > See Dennis' quotes from above. > > I had thought that device-independent troff directly drove some changes > to the C language (not just support libraries), but maybe I'm wrong. It likely did. Steve may be able to add color here. > I have no idea how the Ritchie compiler or PCC evolved after Seventh > Edition (1979). PCC took the mantel, and eventually, PCC2 took that -- look in SVR4 sources for the latter. > The main thing I know about it is that if an > implementation targeted x86, it added `near` and `far` keywords to > accommodate segmented memory and/or DOS-style object file formats. I think Lattice C did that first, and it was picked up by the DOS crew. Most of us thought it was an abomination, but we had switched to 32-bits by then and Vaxen and 68000s. > Well, you _could_ bypass it and use read() and write() directly. Say, > in cat(1), the way Pike intended. ;-) > And people did since that was the >>UNIX<< IO system, but it was not the Honeywell or MVS system (or ITS for the Synder PDP-10 compiler). As I said, please remember that systems programming languages were like assemblers; the local OS supplied the I/O library. Lesk codified a scheme that could be made to work on multiple target OSes. Dennis rethought and reimplemented a similar idea with fewer issues: libS.a. With the White Book, libS.a was formalized, and only then did C have an IO library. In fact, when Lattice did their first DOS compiler, they provided an IO scheme for DOS and only sort of kinda did Standard I/O. Best wishes. Clem -------------- next part -------------- An HTML attachment was scrubbed... URL: From lm at mcvoy.com Mon Mar 10 09:12:22 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sun, 9 Mar 2025 16:12:22 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: <20250309231222.GN24601@mcvoy.com> > Anyway, I frankly prefer to see explicit parens, so the precedence > is clear ??? it's easier to read and less error-prone for the maintainer. This, a thousand times, this. I taught my guys "think of coding as write once, read many. I could care less how much more work it is, write code such that it is the easiest to understand". From usotsuki at buric.co Mon Mar 10 09:18:50 2025 From: usotsuki at buric.co (Steve Nickolas) Date: Sun, 9 Mar 2025 19:18:50 -0400 (EDT) Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309231222.GN24601@mcvoy.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> Message-ID: On Sun, 9 Mar 2025, Larry McVoy wrote: >> Anyway, I frankly prefer to see explicit parens, so the precedence >> is clear ??? it's easier to read and less error-prone for the maintainer. > > This, a thousand times, this. I taught my guys "think of coding as write > once, read many. I could care less how much more work it is, write code > such that it is the easiest to understand". > I assume compiler braindeath and always code around it. An extra pair of parentheses never hurt. -uso. From stewart at serissa.com Mon Mar 10 09:39:46 2025 From: stewart at serissa.com (Lawrence Stewart) Date: Sun, 9 Mar 2025 19:39:46 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250309231222.GN24601@mcvoy.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> Message-ID: <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> The C operator precedence table has 15 precedence levels, from “++" down to “," (see https://en.cppreference.com/w/c/language/operator_precedence) This is nuts. I don’t remember them and I wouldn’t trust an engineer who claimed to. Around 2005, when I was doing some chip verification, I found a hard to notice operator precedence bug (in VHDL, but it is the same issue) that would have cost us a half-million dollar mask spin. If there is more than one operator, I use parens (I do write a[x] + b[x], that one I know.) Our K-12 system isn’t doing us any favors when they think PEMDAS is “mathematics”. -L PS I’ve been a little angry about this since my 6th grader got marked down for using “extra” parentheses in class. https://larry.stewart.org/2010/12/05/order-of-operations-evil-and-pernicious/ > On Mar 9, 2025, at 19:12, Larry McVoy wrote: > >> Anyway, I frankly prefer to see explicit parens, so the precedence >> is clear ??? it's easier to read and less error-prone for the maintainer. > > This, a thousand times, this. I taught my guys "think of coding as write > once, read many. I could care less how much more work it is, write code > such that it is the easiest to understand". -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: favicon.ico Type: image/vnd.microsoft.icon Size: 4119 bytes Desc: not available URL: From kenbob at gmail.com Mon Mar 10 10:38:29 2025 From: kenbob at gmail.com (Ken Thompson) Date: Sun, 9 Mar 2025 17:38:29 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <174154718981.615624.15831772136951719489@minnie.tuhs.org> Message-ID: > Was smalgol also known as BC Algol, as described here: no, bc algol was a phd thesis of ralph love. i was an RA for ralph. he was a phd student for wirth. it was supposed to be the first (only?) complete implementation of algol 60. (thats an interesting statement: think of integer labels, and recursive call by name.) i coded most of the compile-to-byte-code pass of the compiler. ralph designed it. later i wrote a byte-code interpreter on a pdp-8s, which showed the memory state graphically. On Sun, Mar 9, 2025 at 2:01 PM Paul McJones wrote: > Ken, > > Was smalgol also known as BC Algol, as described here: > > https://www.softwarepreservation.org/projects/ALGOL/algol60impl/#BC_ALGOL > > On Mar 9, 2025, at 12:06 PM, Ken Thompson > > wrote: > > how about smalgol? > > it was an algol-like language with just int and float types. > i dont know its history, but it came out of berkeley near > when Niklaus Wirth was there. it compiled for the ibm 7094 > in normal batch processing fashion. i converted it to a jit > into memory in order to skip the loading phase. i used > it for a lot of my fun-work. (1965-66) > > mainframe time, then, was a big factor in the computing process. > smalgol could compile, load, and run in about 1 cpu-second. > > smalgol was all ibm-cards, but it was on my mind through > the bcpl to b to nb phases. i would use the modern word > "influencer.” > > > > Paul McJones > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From stuff at riddermarkfarm.ca Mon Mar 10 10:55:20 2025 From: stuff at riddermarkfarm.ca (Stuff Received) Date: Sun, 9 Mar 2025 20:55:20 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> Message-ID: On 2025-03-09 19:39, Lawrence Stewart wrote: > The C operator precedence table has 15 precedence levels, from “++" down > to “," > (see https://en.cppreference.com/w/c/language/operator_precedence > ) > > This is nuts.  I don’t remember them and I wouldn’t trust an engineer > who claimed to. > > Around 2005, when I was doing some chip verification, I found a hard to > notice operator precedence bug (in VHDL, but it is the same issue) that > would have cost us a half-million dollar mask spin. > > If there is more than one operator, I use parens (I do write a[x] + > b[x], that one I know.) > > Our K-12 system isn’t doing us any favors when they think PEMDAS is > “mathematics”. > > -L > > PS I’ve been a little angry about this since my 6th grader got marked > down for using “extra” parentheses in class. Parentheses around expressions were required in our coding style. S. From robpike at gmail.com Mon Mar 10 11:19:44 2025 From: robpike at gmail.com (Rob Pike) Date: Mon, 10 Mar 2025 12:19:44 +1100 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> Message-ID: I have a shell script that helps: % cat bin/ops #!/usr/local/plan9/bin/rc cat < . left 2 ! ~ ++ -- - (type) * & sizeof right 3 * / % left 4 + - left 5 >> << left 6 < <= > >= left 7 == != left 8 & left 9 ^ left 10 | left 11 && left 12 || left 13 ?: right 14 = op= right 15 , left ! % I used to use it regularly. Later I did the obvious followup: % cat bin/goops #!/bin/sh cat <> & &^ 5 + - | ^ 4 == != < <= > >= 3 <- 2 && 1 || ! % The difference besides the obvious is that it turned out I never need goops. (Also gofmt uses spaces cleverly to show grouping.) -rob -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Mon Mar 10 11:31:19 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Sun, 9 Mar 2025 18:31:19 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: <339C95CB-A318-433F-A0C3-78E756C7FB14@iitbombay.org> Criticisms of C should not be viewed as "casting aspersions". We can't change the past but we can certainly assess the effect of past decisions. [Not sure that would change anything though, but we are just chitchatting!] IMHO with C what we have is a tragedy of scale. A pleasant to use language that worked/works well for a small group of highly competent people with skills beyond programming doesn't work so well for much larger groups of people with varying level of competency & skill. Just as the informality and few implicit rules of a village doesn't scale to large cities. I believe Apollo's Domain/OS was implemented in Pascal. Tunis that Greg Woods alluded to was implemented in Concurrent Euclid. KeyKOS (a capability based system) was implemented in PL/I. Per Brinch Hansen implemented a couple of OSes in Concurrent Pascal. And so on. I think the original Mac OS was in Pascal? Of course, most of these efforts came later or were on much larger systems. The beauty of Unix is in its relatively few but well defined components & their composability. I wouldn't know if the design of C had something to do with that or if it was just the pure genius of Thompson, Ritchie and their cohorts at Bell Labs. > On Mar 8, 2025, at 7:46 PM, Dan Cross wrote: > > As I mentioned in the discussion about C, it's easy to look back with > a modern perspective and cast aspersions on C. But this got me > thinking, what would possible alternatives have been? In the context > of the very late 1960s heading into the early 70s, and given the > constraints of the PDP-7 and early PDP-11s, what languages would one > consider for implementing a system like early Unix? Dennis's history > paper mentioned a very short-lived effort at Fortran, and I asked > about that a few years ago, but no one really remembered much about > it; I gather this was an experiment that lasted a few days or weeks > and was quickly abandoned. But what else? > > My short list included PL/1, Algol/W, Fortran, and Pascal. Fortran was > already mentioned. I don't think PL/1 (or PL/I) could have fit on > those machines. Pascal was really targeted towards teaching and would > have required pretty extensive work to be usable. The big question > mark in my mind is Algol/W; how well known was it at the time? Was any > consideration for it made? > > Obviously, the decision to go with BCPL (as the basis for B, which > beget C) was made and the rest is history. But I'm really curious > about how, in the research culture at the time, information about new > programming languages made its way through the community. > > - Dan C. From johnl at taugh.com Mon Mar 10 11:51:50 2025 From: johnl at taugh.com (John Levine) Date: 9 Mar 2025 21:51:50 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> References: <20250309201345.02C6DBF5512D@ary.qy> <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> Message-ID: <20250310015150.C1294BF5E87E@ary.qy> It appears that Luther Johnson said: >I believe CP/M was written entirely in 8080 assembly language. PL/M was >a PL/1 subset, I think Gary Kildall was the main programmer behind that, >and I'm sure there was a version for CP/M, but I doubt CP/M was written >in it, ... See this Wikipedia article on PL/M, which summarizes the history that Kildall wrote up in a 1993 manuscript now online at the CHM. If he says he wrote it in PL/M I am inclined to believe him. https://en.wikipedia.org/wiki/PL/M#CITEREFKildall1993 R's, John From douglas.mcilroy at dartmouth.edu Mon Mar 10 12:03:10 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Sun, 9 Mar 2025 22:03:10 -0400 Subject: [TUHS] What would early alternatives to C have been? Message-ID: > everyone should write for their first compiler in Pascal for a > simple language and no cheating using YACC. You need to write the whole > thing if you want to understand how parsing really works. Yacc certainly makes it easier to write parsers for big grammars, but it's far from cheating. You need to know a lot more about parsing to use Yacc than you need to roll your own. Hand parsing of a tiny grammar is almost a necessary step on the way to understanding Yacc. But I think hand-building the whole parser for a compiler is unnecessary torture--like doing trigonometry with log tables. Doug -------------- next part -------------- An HTML attachment was scrubbed... URL: From sauer at technologists.com Mon Mar 10 12:28:01 2025 From: sauer at technologists.com (Charles H. Sauer) Date: Sun, 9 Mar 2025 21:28:01 -0500 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: [While walking 5 miles through the snow …] > On Mar 9, 2025, at 9:03 PM, Douglas McIlroy wrote: > > > everyone should write for their first compiler in Pascal for a > > simple language and no cheating using YACC. You need to write the whole > > thing if you want to understand how parsing really works. > > Yacc certainly makes it easier to write parsers for big grammars, but it's far from cheating. You need to know a lot more about parsing to use Yacc than you need to roll your own. > > Hand parsing of a tiny grammar is almost a necessary step on the way to understanding Yacc. But I think hand-building the whole parser for a compiler is unnecessary torture--like doing trigonometry with log tables. > > Doug In 1971 compiler construction course at UT-Austin, we had to use CDC Fortran to build a compiler for a subset of Algol 60. Undergraduates used punch cards and RJE to the 6600/6400 shared by all but the privleged few. Charlie -- voice: +1.512.784.7526 e-mail: sauer at technologists.com fax: +1.512.346.5240 web: https://technologists.com/sauer/ Facebook/Google/LinkedIn/mas.to: CharlesHSauer -------------- next part -------------- An HTML attachment was scrubbed... URL: From luther.johnson at makerlisp.com Mon Mar 10 12:54:15 2025 From: luther.johnson at makerlisp.com (Luther Johnson) Date: Sun, 9 Mar 2025 19:54:15 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310015150.C1294BF5E87E@ary.qy> References: <20250309201345.02C6DBF5512D@ary.qy> <846b9cb5-46dc-6d88-308c-c28aed4d7911@makerlisp.com> <20250310015150.C1294BF5E87E@ary.qy> Message-ID: Got it. My misunderstanding, acknowledged. On 03/09/2025 06:51 PM, John Levine wrote: > It appears that Luther Johnson said: >> I believe CP/M was written entirely in 8080 assembly language. PL/M was >> a PL/1 subset, I think Gary Kildall was the main programmer behind that, >> and I'm sure there was a version for CP/M, but I doubt CP/M was written >> in it, ... > See this Wikipedia article on PL/M, which summarizes the history that > Kildall wrote up in a 1993 manuscript now online at the CHM. If he > says he wrote it in PL/M I am inclined to believe him. > > https://en.wikipedia.org/wiki/PL/M#CITEREFKildall1993 > > R's, > John > From lm at mcvoy.com Mon Mar 10 13:06:49 2025 From: lm at mcvoy.com (Larry McVoy) Date: Sun, 9 Mar 2025 20:06:49 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> Message-ID: <20250310030649.GR24601@mcvoy.com> On Sun, Mar 09, 2025 at 08:55:20PM -0400, Stuff Received wrote: > On 2025-03-09 19:39, Lawrence Stewart wrote: > >The C operator precedence table has 15 precedence levels, from ???++" down > >to ???," > >(see https://en.cppreference.com/w/c/language/operator_precedence > >) > > > >This is nuts. ??I don???t remember them and I wouldn???t trust an engineer > >who claimed to. > > > >Around 2005, when I was doing some chip verification, I found a hard to > >notice operator precedence bug (in VHDL, but it is the same issue) that > >would have cost us a half-million dollar mask spin. > > > >If there is more than one operator, I use parens (I do write a[x] + b[x], > >that one I know.) > > > >Our K-12 system isn???t doing us any favors when they think PEMDAS is > >???mathematics???. > > > >-L > > > >PS I???ve been a little angry about this since my 6th grader got marked > >down for using ???extra??? parentheses in class. > > Parentheses around expressions were required in our coding style. Good for you, we were the same. It's not that hard to make the code easier to read. --lm From robpike at gmail.com Mon Mar 10 14:10:15 2025 From: robpike at gmail.com (Rob Pike) Date: Mon, 10 Mar 2025 15:10:15 +1100 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: A rare case where I disagree with you, Doug. If the language is reasonably regular (I do not mean in the strict Kleene sense), a recursive descent parser is not much harder to write than a yacc grammar, and much smoother at providing good error messages. Having done many yaccs and many RD parsers, I no longer go to yacc. To put it another way, there are few programming tasks I enjoy more than writing a recursive descent parser for a sane language. Now if the language is not so regular, my position might shift. I do recall Bjarne dynamically editing the generated tables mid-parse to get yacc to handle at least one stage of C++'s development. Another way to think of it is that if you are designing the language and it is undergoing frequent changes in grammar, yacc could certainly be move you along faster. But even then once things had settled I'd still redo it as RD, for the quality of the result. You can credit Stephen R. "Software" Steve for this change in my thinking. -rob On Mon, Mar 10, 2025 at 1:12 PM Douglas McIlroy < douglas.mcilroy at dartmouth.edu> wrote: > > everyone should write for their first compiler in Pascal for a > > simple language and no cheating using YACC. You need to write the whole > > thing if you want to understand how parsing really works. > > Yacc certainly makes it easier to write parsers for big grammars, but > it's far from cheating. You need to know a lot more about parsing to use > Yacc than you need to roll your own. > > Hand parsing of a tiny grammar is almost a necessary step on the way to > understanding Yacc. But I think hand-building the whole parser for a > compiler is unnecessary torture--like doing trigonometry with log tables. > > Doug > -------------- next part -------------- An HTML attachment was scrubbed... URL: From arnold at skeeve.com Mon Mar 10 19:12:06 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Mon, 10 Mar 2025 03:12:06 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310030649.GR24601@mcvoy.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> Message-ID: <202503100912.52A9C6oT198693@freefriends.org> Larry McVoy wrote: > > Parentheses around expressions were required in our coding style. > > Good for you, we were the same. It's not that hard to make the code > easier to read. I think, as in all things, "always use parentheses" can be carried too far. I find if (a == b && c == d) perfectly reasonable, but if ((a == b) && (c == d)) to be just silly. That said, when mixing && and ||, I do parenthesize: if ((a == b && c == d) || (p == q && x == y)) If there are lots of OR conditions, I break lines: if ( (a == b && c == d) || (e == f && g == h) || (i == j && k == l) || (p == q && x == y)) I will also use parentheses with the bitwise operators. As in many things related to style, taste and judgement are needed, not just slavish adherence to instructions without any understanding of them. My two cents. Arnold (P.S. Don't get me started on `if (NULL == pointer)' and `if (42 != foo)'. Grrrr...) From lm at mcvoy.com Tue Mar 11 00:41:31 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 10 Mar 2025 07:41:31 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <202503100912.52A9C6oT198693@freefriends.org> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> Message-ID: <20250310144131.GS24601@mcvoy.com> On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > I think, as in all things, "always use parentheses" can be carried > too far. I find > > if (a == b && c == d) > > perfectly reasonable, but > > if ((a == b) && (c == d)) > > to be just silly. Color me silly then. My eyes parse the "silly" form faster. And I was the boss so my eyes won. Over time, everyone at my shop came to agree with me. One mans silly is another mans faster reading. If you worked for me, you did the silly form. And everyone came around to liking it. It's just faster to read. From clemc at ccc.com Tue Mar 11 00:52:20 2025 From: clemc at ccc.com (Clem Cole) Date: Mon, 10 Mar 2025 10:52:20 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310144131.GS24601@mcvoy.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> Message-ID: On Mon, Mar 10, 2025 at 10:41 AM Larry McVoy wrote: > On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > > I think, as in all things, "always use parentheses" can be carried > > too far. I find > > > > if (a == b && c == d) > > > > perfectly reasonable, but > > > > if ((a == b) && (c == d)) > > > > to be just silly. > > Color me silly then. Me too. > My eyes parse the "silly" form faster. Exactly. > And I was the boss so my eyes won. Over time, everyone at my shop came > to agree with me. > One mans silly is another mans faster reading. > The problem is that in the first form, you have to slow down to ensure you have the order correctly, Maybe it's my dyslexia speaking, but the "extra" parens make it clear without having to slow down. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Tue Mar 11 00:57:39 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 10 Mar 2025 10:57:39 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310144131.GS24601@mcvoy.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> Message-ID: On Mon, Mar 10, 2025 at 10:41 AM Larry McVoy wrote: > On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > > I think, as in all things, "always use parentheses" can be carried > > too far. I find > > > > if (a == b && c == d) > > > > perfectly reasonable, but > > > > if ((a == b) && (c == d)) > > > > to be just silly. > > Color me silly then. My eyes parse the "silly" form faster. And I was the > boss so my eyes won. Over time, everyone at my shop came to agree with me. > One mans silly is another mans faster reading. > > If you worked for me, you did the silly form. And everyone came around to > liking it. It's just faster to read. I'm with Arnold on this one in terms of preference, but I think there's a meta point that's interesting to consider: style localizations facilitate understanding and knowledge transfer across large code bases. Google has a codebase measured in the BLOC (B as in billion), and very strict style guides; they're really more hard and fast rules as opposed to a soft thing that the term "guide" tends to imply. Personally, I hated Google's style formats; but after a month or two they largely disappeared for me and I stopped thinking about them. I still thought and continue to think they're ugly, but the network effects of being able to `cd` anywhere in that massive monorepo and be able to read a file without the cognitive overhead of internalizing that this programmer preferred to put the braces somewhere else relative to the last programmer was a huge win, far outweighing my aesthetic objections. This directly contributed to allowing the organization to scale to tens of thousands of programmers working in the same code base simultaneously. Supported by tools like automatic formatters, folks concentrated on the problem at hand and not the minutiae of formatting. Uniform style also facilitated things like large-scale, automated refactorings; another huge win in evolving code to newer standards, fixing old problems, and so forth. So even though I didn't like the specifics of the local standard, I respected what it did for us. So I don't think the specifics of whether you put parentheses around every boolean expression matter as much as whether you aggressively commit to consistency everywhere and support that with tooling and automation. - Dan C. From lm at mcvoy.com Tue Mar 11 01:06:11 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 10 Mar 2025 08:06:11 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> Message-ID: <20250310150611.GV24601@mcvoy.com> On Mon, Mar 10, 2025 at 10:52:20AM -0400, Clem Cole wrote: > On Mon, Mar 10, 2025 at 10:41???AM Larry McVoy wrote: > > > On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > > > I think, as in all things, "always use parentheses" can be carried > > > too far. I find > > > > > > if (a == b && c == d) > > > > > > perfectly reasonable, but > > > > > > if ((a == b) && (c == d)) > > > > > > to be just silly. > > > > Color me silly then. > > Me too. > > > My eyes parse the "silly" form faster. > > Exactly. > > > And I was the boss so my eyes won. Over time, everyone at my shop came > > to agree with me. > > One mans silly is another mans faster reading. > > > The problem is that in the first form, you have to slow down to ensure you > have the order correctly, Maybe it's my dyslexia speaking, but the "extra" > parens make it clear without having to slow down. Exactly. It's not that Clem and I are stupid, we aren't. It's that we want to use our brains to work on stuff that is actually hard. Making us spend effort on "silly" stuff is just rude, just don't do that. You are entitled to your opinion for sure, but you are less entitled when that opinion creates pointless work for other people. From lm at mcvoy.com Tue Mar 11 01:09:25 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 10 Mar 2025 08:09:25 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> Message-ID: <20250310150925.GW24601@mcvoy.com> On Mon, Mar 10, 2025 at 10:57:39AM -0400, Dan Cross wrote: > So I don't think the specifics of whether you put parentheses around > every boolean expression matter as much as whether you aggressively > commit to consistency everywhere and support that with tooling and > automation. Amen. I've contributed to GNU C code, and while I _HATE_ their coding style, my contributions were in that style. It's their house, their rules, and Dan is spot on as to why that is important. From clemc at ccc.com Tue Mar 11 01:12:37 2025 From: clemc at ccc.com (Clem Cole) Date: Mon, 10 Mar 2025 11:12:37 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: below... (At the risk of a case of being in disagreement with someone I rarely am). On Sun, Mar 9, 2025 at 10:03 PM Douglas McIlroy < douglas.mcilroy at dartmouth.edu> wrote: > > Yacc certainly makes it easier to write parsers for big grammars, but > it's far from cheating. You need to know a lot more about parsing to use > Yacc than you need to roll your own. > I disagree here. I have wtchged too many young programmers that I realized did not understand what yacc was doing and came to understanding that never built a simple parse before. I'm not asking them to rebuild a parser for Algol68 or C++ in a one semester course. I asking that to you take something like Dennis Allisons "tiny basic" assignment for his Stanfrodf students in the lat 1970s and ask them to build that. Peter Hibbard gave us a super small Algol-ish language, and we generated code for the "CMU Mini" (which was basically Gordon Bell's the precursor to the PDP-8 (see ISP below) - in my time the "Dragon Book was not yet written BTW. I really understood what Stev had accomplished. The interesting thing is while I tend to use yacc for most parsers, I've found myself building recursive descent parsers in a couple of cases where it seems to make better sense. > > Hand parsing of a tiny grammar is almost a necessary step on the way to > understanding Yacc. But I think hand-building the whole parser for a > compiler is unnecessary torture--like doing trigonometry with log tables. > Point table ... however a funny thing to consider. My father was a high school math teacher for 40 years. He noted that the loss of slide rules and trig tables made his "modern" (post-calculator) students not understand interpolation.To me, learning to use mathematical tables is a skill that never hurts. I don't even think of asking people "to walk uphill both ways to school in the snow and rain instead of riding a heated, dry yellow school bus." I am asking the student to learn foundational ideas and build skills up top of core understanding. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From cowan at ccil.org Tue Mar 11 01:19:19 2025 From: cowan at ccil.org (John Cowan) Date: Mon, 10 Mar 2025 11:19:19 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: I was working at the whiteboard during a job interview once. I had been asked to write a function to report if its input had balanced parentheses. No problem: I wrote an RD parser in Python (which I prefer for whiteboarding) to detect balance and return True if the parse was successful and False if EOF was reached. I was starting to write some tests when the interviewer interrupted me. "What is that?" "It's a recursive descent parser. It detects if the input is well-formed." Blank look. I started to walk him through the code. He interrupted me. "Excuse me, I'll be back in a few minutes." Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the recruiter will get back to you." That's the last I hear from them. On Mon, Mar 10, 2025, 12:10 AM Rob Pike wrote: > A rare case where I disagree with you, Doug. If the language is reasonably > regular (I do not mean in the strict Kleene sense), a recursive descent > parser is not much harder to write than a yacc grammar, and much smoother > at providing good error messages. Having done many yaccs and many RD > parsers, I no longer go to yacc. > > To put it another way, there are few programming tasks I enjoy more than > writing a recursive descent parser for a sane language. > > Now if the language is not so regular, my position might shift. I do > recall Bjarne dynamically editing the generated tables mid-parse to get > yacc to handle at least one stage of C++'s development. > > Another way to think of it is that if you are designing the language and > it is undergoing frequent changes in grammar, yacc could certainly be move > you along faster. But even then once things had settled I'd still redo it > as RD, for the quality of the result. > > You can credit Stephen R. "Software" Steve for this change in my thinking. > > -rob > > > On Mon, Mar 10, 2025 at 1:12 PM Douglas McIlroy < > douglas.mcilroy at dartmouth.edu> wrote: > >> > everyone should write for their first compiler in Pascal for a >> > simple language and no cheating using YACC. You need to write the whole >> > thing if you want to understand how parsing really works. >> >> Yacc certainly makes it easier to write parsers for big grammars, but >> it's far from cheating. You need to know a lot more about parsing to use >> Yacc than you need to roll your own. >> >> Hand parsing of a tiny grammar is almost a necessary step on the way to >> understanding Yacc. But I think hand-building the whole parser for a >> compiler is unnecessary torture--like doing trigonometry with log tables. >> >> Doug >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Tue Mar 11 01:24:32 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 10 Mar 2025 11:24:32 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: On Mon, Mar 10, 2025 at 11:13 AM Clem Cole wrote: > On Sun, Mar 9, 2025 at 10:03 PM Douglas McIlroy wrote: >> Yacc certainly makes it easier to write parsers for big grammars, but it's far from cheating. You need to know a lot more about parsing to use Yacc than you need to roll your own. > > I disagree here. I have wtchged too many young programmers that I realized did not understand what yacc was doing and came to understanding that never built a simple parse before. Funny, I agree with both of you, and with Rob. :-) If I'm not mistaken, I think what Doug was suggesting was that, to really understand what YACC is _doing_ requires understanding a lot more about the theory of parsing, grammars, and so on, than one needs to understand to write a fairly simple RD parser. RD makes a lot of sense intuitively, but once you introduce table-driven parsing, LALR(1), etc, you get into a whole different level of theory and rigor. Of course, by design the tool abstracts that away from you, so to just _use_ it you don't necessarily need to understand as much. >[...] > The interesting thing is while I tend to use yacc for most parsers, I've found myself building recursive descent parsers in a couple of cases where it seems to make better sense. Funny, I'm the opposite: I've used YACC for a few parsers, but prefer to roll my own. I find that I'm either parsing something very simple, in which case reaching for YACC feels like cutting butter with a chainsaw, or I need to do something that YACC isn't super great at (like run in a multithreaded environment). YACC is superb at what it does, but not something I feel like I need that often. - Dan C. From crossd at gmail.com Tue Mar 11 01:27:46 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 10 Mar 2025 11:27:46 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310150611.GV24601@mcvoy.com> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150611.GV24601@mcvoy.com> Message-ID: On Mon, Mar 10, 2025 at 11:12 AM Larry McVoy wrote: > On Mon, Mar 10, 2025 at 10:52:20AM -0400, Clem Cole wrote: > > On Mon, Mar 10, 2025 at 10:41???AM Larry McVoy wrote: > > > On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > > > > I think, as in all things, "always use parentheses" can be carried > > > > too far. I find > > > > > > > > if (a == b && c == d) > > > > > > > > perfectly reasonable, but > > > > > > > > if ((a == b) && (c == d)) > > > > > > > > to be just silly. > > > > > > Color me silly then. > > > > Me too. > > > > > My eyes parse the "silly" form faster. > > > > Exactly. > > > > > And I was the boss so my eyes won. Over time, everyone at my shop came > > > to agree with me. > > > One mans silly is another mans faster reading. > > > > > The problem is that in the first form, you have to slow down to ensure you > > have the order correctly, Maybe it's my dyslexia speaking, but the "extra" > > parens make it clear without having to slow down. > > Exactly. It's not that Clem and I are stupid, we aren't. Oh goodness; I hope we can all agree that no one was suggesting that! > [snip] creates pointless work for other people. Along these lines, I can't stand when people write, `if (!strcmp(a, b)) { /* strings are equal */ }` While true, I find the negation particularly annoying in this context, and prefer either `if (strcmp(a, b) == 0) { ... }` or writing a small `streq()` function that returns boolean true/false. - Dan C. From lm at mcvoy.com Tue Mar 11 01:46:56 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 10 Mar 2025 08:46:56 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150611.GV24601@mcvoy.com> Message-ID: <20250310154656.GY24601@mcvoy.com> On Mon, Mar 10, 2025 at 11:27:46AM -0400, Dan Cross wrote: > > [snip] creates pointless work for other people. > > Along these lines, I can't stand when people write, `if (!strcmp(a, > b)) { /* strings are equal */ }` > > While true, I find the negation particularly annoying in this context, > and prefer either `if (strcmp(a, b) == 0) { ... }` or writing a small > `streq()` function that returns boolean true/false. >From bitkeeper's style.h: #define streq(a,b) (!strcmp((a),(b))) and #define unless(e) if (!(e)) Maybe I should post the whole file :-) From imp at bsdimp.com Tue Mar 11 01:47:01 2025 From: imp at bsdimp.com (Warner Losh) Date: Mon, 10 Mar 2025 09:47:01 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150611.GV24601@mcvoy.com> Message-ID: On Mon, Mar 10, 2025 at 9:28 AM Dan Cross wrote: > On Mon, Mar 10, 2025 at 11:12 AM Larry McVoy wrote: > > On Mon, Mar 10, 2025 at 10:52:20AM -0400, Clem Cole wrote: > > > On Mon, Mar 10, 2025 at 10:41???AM Larry McVoy wrote: > > > > On Mon, Mar 10, 2025 at 03:12:06AM -0600, arnold at skeeve.com wrote: > > > > > I think, as in all things, "always use parentheses" can be carried > > > > > too far. I find > > > > > > > > > > if (a == b && c == d) > > > > > > > > > > perfectly reasonable, but > > > > > > > > > > if ((a == b) && (c == d)) > > > > > > > > > > to be just silly. > > > > > > > > Color me silly then. > > > > > > Me too. > > > > > > > My eyes parse the "silly" form faster. > > > > > > Exactly. > > > > > > > And I was the boss so my eyes won. Over time, everyone at my shop > came > > > > to agree with me. > > > > One mans silly is another mans faster reading. > > > > > > > The problem is that in the first form, you have to slow down to ensure > you > > > have the order correctly, Maybe it's my dyslexia speaking, but the > "extra" > > > parens make it clear without having to slow down. > > > > Exactly. It's not that Clem and I are stupid, we aren't. > > Oh goodness; I hope we can all agree that no one was suggesting that! > Yea. That's the problem with style debates: It's all about taste, preference and workflow. Those items differ wildly, are core to one's identity (to a degree not usually appreciated) and so it turns personal w/o people intending it to do so. I saw a presentation years ago that went over all the human factors and showed that there's too many 80/20 60/40 divides in how people's brains work to have a universal style that would be friction free for everybody. I searched for it just now, but couldn't find it. I'm team "If you want that many parens, use LISP" not team "Explicit parens make it clearer" myself, but as you see, my internal framing isn't exactly respectul while trying to be funny... Another hazard of style debates. > > [snip] creates pointless work for other people. > > Along these lines, I can't stand when people write, `if (!strcmp(a, > b)) { /* strings are equal */ }` > > While true, I find the negation particularly annoying in this context, > and prefer either `if (strcmp(a, b) == 0) { ... }` or writing a small > `streq()` function that returns boolean true/false. > There's a number of things in C that we'd do differently in other languages... FreeBSD encourages the latter style over the former and is an odd choice of concise and verbose when it comes to notation (sometimes delete the extra parens, sometimes have extra == 0 or != 0). Warner -------------- next part -------------- An HTML attachment was scrubbed... URL: From arnold at skeeve.com Tue Mar 11 02:30:19 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Mon, 10 Mar 2025 10:30:19 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250310150925.GW24601@mcvoy.com> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150925.GW24601@mcvoy.com> Message-ID: <202503101630.52AGUJYJ226581@freefriends.org> Larry McVoy wrote: > On Mon, Mar 10, 2025 at 10:57:39AM -0400, Dan Cross wrote: > > So I don't think the specifics of whether you put parentheses around > > every boolean expression matter as much as whether you aggressively > > commit to consistency everywhere and support that with tooling and > > automation. > > Amen. I've contributed to GNU C code, and while I _HATE_ their coding > style, my contributions were in that style. It's their house, their > rules, and Dan is spot on as to why that is important. I agree 110% with Dan as well. And with Larry about GNU code. Long ago I worked in a startup, and early on the developers got together and wrote up a coding style to use. A year later a new guy came on board, who refused to follow it. It was then easy to see when what's-his-name had touched a part of the code. (I forgot what his name was.) Quite annoying. Arnold From tuhs at tuhs.org Tue Mar 11 04:18:45 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Mon, 10 Mar 2025 18:18:45 +0000 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <202503101630.52AGUJYJ226581@freefriends.org> References: <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150925.GW24601@mcvoy.com> <202503101630.52AGUJYJ226581@freefriends.org> Message-ID: <1BihmLbJy9QZ-YoC9Kzo67-SAJiWxAJXKoFfoGaPie8bgOGdUlT8YZv7MZiP6PvB8J-wsAXLcrBpjV2l93nyLH_EPTfkWKFayI1idGPUdxw=@protonmail.com> Sent with Proton Mail secure email. On Monday, March 10th, 2025 at 9:30 AM, arnold at skeeve.com wrote: > Larry McVoy lm at mcvoy.com wrote: > > > On Mon, Mar 10, 2025 at 10:57:39AM -0400, Dan Cross wrote: > > > > > So I don't think the specifics of whether you put parentheses around > > > every boolean expression matter as much as whether you aggressively > > > commit to consistency everywhere and support that with tooling and > > > automation. > > > > Amen. I've contributed to GNU C code, and while I HATE their coding > > style, my contributions were in that style. It's their house, their > > rules, and Dan is spot on as to why that is important. > > > I agree 110% with Dan as well. And with Larry about GNU code. > > Long ago I worked in a startup, and early on the developers got > together and wrote up a coding style to use. A year later a new > guy came on board, who refused to follow it. It was then > easy to see when what's-his-name had touched a part of the code. (I forgot > what his name was.) Quite annoying. > > Arnold I'm quite enjoying following along in this thread as my current work team is in a growth pattern. For the first year of the project it was mainly just myself and my manager slinging code. We had been bumped over from the same project so already had a pretty good synergy. Then we brought in a few new folks and managed to settle into a good groove, we weren't using automated style enforcement but didn't tend to step on each others toes. This year the team has expanded both a few times over in numbers as well as in geographic footprint and the divergences in approach are finally becoming "a thing" that needs to be addressed intentionally rather than organically. It's proving quite a pain, I used to be a stickler for style and have in the past several years tried to relax that attitude...which now with the growing need to pay attention to that again, it's a bit jarring to now see the pendulum swing in the opposite direction. I generally like deferring to folks tasked with a problem to come up with all the minutiae surrounding their solution, but with the size our team is growing to, I'm coming to recognize where having those structural guardrails saves everyone headaches in the long run. Generally I'm a more parens is fine if it makes meaning clear kind of person, annoyingly though with some assembler projects. Since parentheses are more context-sensitive in different assembler languages, I've found plenty of situations where I use parentheses in a macro or equate definition only to find later when they're used with some operation, they trigger an addressing mode that I didn't intend. - Matt G. From stuff at riddermarkfarm.ca Tue Mar 11 04:39:32 2025 From: stuff at riddermarkfarm.ca (Stuff Received) Date: Mon, 10 Mar 2025 14:39:32 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <202503101630.52AGUJYJ226581@freefriends.org> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150925.GW24601@mcvoy.com> <202503101630.52AGUJYJ226581@freefriends.org> Message-ID: <8e82abeb-1b12-b0a1-91d7-e4b6271debdc@riddermarkfarm.ca> On 2025-03-10 12:30, arnold at skeeve.com wrote: > Larry McVoy wrote: > >> On Mon, Mar 10, 2025 at 10:57:39AM -0400, Dan Cross wrote: >>> So I don't think the specifics of whether you put parentheses around >>> every boolean expression matter as much as whether you aggressively >>> commit to consistency everywhere and support that with tooling and >>> automation. >> >> Amen. I've contributed to GNU C code, and while I _HATE_ their coding >> style, my contributions were in that style. It's their house, their >> rules, and Dan is spot on as to why that is important. > > I agree 110% with Dan as well. And with Larry about GNU code. > > Long ago I worked in a startup, and early on the developers got > together and wrote up a coding style to use. A year later a new > guy came on board, who refused to follow it. It was then > easy to see when what's-his-name had touched a part of the code. (I forgot > what his name was.) Quite annoying. > > Arnold We foresaw that and ensured that different code could be reformatted with indent. S. From tuhs at tuhs.org Tue Mar 11 04:56:46 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Mon, 10 Mar 2025 11:56:46 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <202503101630.52AGUJYJ226581@freefriends.org> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309231222.GN24601@mcvoy.com> <1ED6C816-5CFC-447A-AF8A-EB1166AF881B@serissa.com> <20250310030649.GR24601@mcvoy.com> <202503100912.52A9C6oT198693@freefriends.org> <20250310144131.GS24601@mcvoy.com> <20250310150925.GW24601@mcvoy.com> <202503101630.52AGUJYJ226581@freefriends.org> Message-ID: <1D589632-975D-4F3D-A1E9-F16D22F176E4@iitbombay.org> On Mar 10, 2025, at 9:30 AM, arnold at skeeve.com wrote: > > Long ago I worked in a startup, and early on the developers got > together and wrote up a coding style to use. A year later a new > guy came on board, who refused to follow it. It was then > easy to see when what's-his-name had touched a part of the code. (I forgot > what his name was.) Quite annoying. Most every company where I worked did this (agreed upon a coding style). One of my best engineers, who usually said very little, preferred a very different style but he set up scripts so that in the editor he saw *his* favorite style and before checking in a file he ran indent to conform to the agreed upon style. From tuhs at tuhs.org Tue Mar 11 06:49:29 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Mon, 10 Mar 2025 13:49:29 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: Perhaps the interviewer was looking for something dumb like the following and not a full RD parser? int count = 0; while (*cp) { char c = *cp++; count += c == '(' ? 1 : c == ')' ? -1 : 0; if (count < 0) return -1; // FAIL: one too many ) } if (count > 0) return -1; // FAIL: too many ( return 0; // SUCCESS Though this will fall apart if you also want to also balance braces &/or brackets and must catch invalid cases like "(..[..)..]"! > On Mar 10, 2025, at 8:19 AM, John Cowan wrote: > > I was working at the whiteboard during a job interview once. I had been asked to write a function to report if its input had balanced parentheses. No problem: I wrote an RD parser in Python (which I prefer for whiteboarding) to detect balance and return True if the parse was successful and False if EOF was reached. > > I was starting to write some tests when the interviewer interrupted me. > > "What is that?" > > "It's a recursive descent parser. It detects if the input is well-formed." > > Blank look. > > I started to walk him through the code. > > He interrupted me. "Excuse me, I'll be back in a few minutes." > > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the recruiter will get back to you." That's the last I hear from them. From dave at horsfall.org Tue Mar 11 05:56:50 2025 From: dave at horsfall.org (Dave Horsfall) Date: Tue, 11 Mar 2025 06:56:50 +1100 (AEDT) Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: On Mon, 10 Mar 2025, John Cowan wrote: > I was working at the whiteboard during a job interview once. I had been > asked to write a function to report if its input had balanced parentheses.  > No problem: I wrote an RD parser in Python (which I prefer for > whiteboarding) to detect balance and return True if the parse was successful > and False if EOF was reached. RD is a bit over the top, isn't it? Pseudocode: set counter to 0 while !EOF do read char char == "(" -> counter++ char == ")" -> counter-- abort "Unbalanced: )" if counter < 0 done abort "Unbalanced: (" if counter > 0 Untested, of course :-) -- Dave From audioskeptic at gmail.com Tue Mar 11 08:55:49 2025 From: audioskeptic at gmail.com (James Johnston) Date: Mon, 10 Mar 2025 15:55:49 -0700 Subject: [TUHS] "Webster's Second on the Head of a Pin"? In-Reply-To: References: Message-ID: Doug, do you know how much better Ziv-Lempel does? I'm guessing "a little bit"? On Mon, Mar 10, 2025 at 3:52 PM Douglas McIlroy < douglas.mcilroy at dartmouth.edu> wrote: > I have sent a copy to Warren for the Unix archives. > > Doug > > On Sat, Dec 28, 2024 at 9:54 PM Royce Williams > wrote: > > > > Someone I know is seeking the original version of an internal Bell Labs > memo from 1974 titled "Webster's Second on the Head of a Pin" by Morris and > Thompson. The topic appears to be related to improving the speed of lookups > or search. It's cited in a few papers as "Unpublished Technical Memo, Bell > Laboratories, Murray Hill, NJ 1974." All I can find online is citations. > Any leads appreciated! > > > > -- > > Royce > -- James D. (jj) Johnston Former Chief Scientist, Immersion Networks -------------- next part -------------- An HTML attachment was scrubbed... URL: From mrochkind at gmail.com Tue Mar 11 09:12:20 2025 From: mrochkind at gmail.com (Marc Rochkind) Date: Mon, 10 Mar 2025 17:12:20 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: This thread started to be about what I thought were system programming languages (e.g., C, BLISS) and seems to have meandered into a general discussion of languages that were around in the 1960s and 1970s, so, what the heck, I'll add my own story. PL/0 is an education programming language introduced in the book, *Algorithms + Data Structures = Programs*, by Niklaus Wirth in 1976. It's a great language for teaching compiler writing because it contains interesting concepts, such as recursive functions, yet isn't overly complicated. I wrote a PL/0 compiler for the IBM 701 ( https://github.com/MarcRochkind/pl0compiler). Yeah, that's not a misprint. I wrote perhaps the world's only 701 emulator ( https://www.mrochkind.com/mrochkind/a-701.html), and my PL/0 compiler runs on it. Unfortunately, I can't verify that the compiled code runs on an actual 701, since I'm sure there haven't been any in operation for many decades. For those of you who haven't had the pleasure, programming the 701 is really hard. It had no index registers, and the sign bit didn't participate in shifts. Still, my compiler compiles full-blown PL/0. So there! ;-) Marc Rochkind On Mon, Mar 10, 2025 at 2:49 PM Bakul Shah via TUHS wrote: > Perhaps the interviewer was looking for something dumb like the following > and not a full RD parser? > > int count = 0; > while (*cp) { > char c = *cp++; > count += c == '(' ? 1 : c == ')' ? -1 : 0; > if (count < 0) return -1; // FAIL: one too many ) > } > if (count > 0) return -1; // FAIL: too many ( > return 0; // SUCCESS > > Though this will fall apart if you also want to also balance braces &/or > brackets and must catch invalid cases like "(..[..)..]"! > > > On Mar 10, 2025, at 8:19 AM, John Cowan wrote: > > > > I was working at the whiteboard during a job interview once. I had been > asked to write a function to report if its input had balanced parentheses. > No problem: I wrote an RD parser in Python (which I prefer for > whiteboarding) to detect balance and return True if the parse was > successful and False if EOF was reached. > > > > I was starting to write some tests when the interviewer interrupted me. > > > > "What is that?" > > > > "It's a recursive descent parser. It detects if the input is > well-formed." > > > > Blank look. > > > > I started to walk him through the code. > > > > He interrupted me. "Excuse me, I'll be back in a few minutes." > > > > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the > recruiter will get back to you." That's the last I hear from them. > > -- Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. -------------- next part -------------- An HTML attachment was scrubbed... URL: From woods at robohack.ca Tue Mar 11 09:25:37 2025 From: woods at robohack.ca (Greg A. Woods) Date: Mon, 10 Mar 2025 16:25:37 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: At Sun, 9 Mar 2025 18:58:14 -0400, Clem Cole wrote: Subject: [TUHS] Re: What would early alternatives to C have been? > > I find where the C standard folks falling into the C++ trap and letting > people pee it. CC++ is a disaster ― primarily because everyone added their > favorite feature. That's while I like Go for applications work, and if I > were to do OS (near the HW development) today and not use C, I would have > to consider Rust. Perhaps Rust gets/deserves its name from the corrosion caused by everyone peeing into it too. For me it's way too much of a kitchen sink of what seem to me to be half-baked ideas. W.r.t. Unix history, and any alternatives to C, well I did mention TUNIS a bit ago, and the fact it was implemented in Concurrent Euclid. That's quite a bit later than the first C though. I would guess though that even its parent, Euclid, which came along a little earlier, would still have been too late to have been useful for early Unix. It's mostly just another Algol derivative anyway. It did have some big funding from the military in the early days, but look where that got it. I think one of the most loved innovations C (and B just before it) brought to programming languages was the use of braces instead of begin/end, along with the elimination of some of the other extra unnecessary keywords like "then". These syntactical changes made programming a lot more fun for many of us. I for one dreaded having to use Pascal, or heaven forbid PL/1, after I got used to C just because of the wordy syntax alone (I was still too inexperienced at the time to appreciate BWK's criticisms of Pascal). I had the "fun" experience of learning BASIC and a bit of Pascal in high school, then being thrown into Pascal, C, PL/1, and lisp, all as learning languages, in first year (the first two on Unix, the second two on Multics). Which reminds me, wasn't there was some mention of difficulties with Pascal et al w.r.t. separate compilation earlier in this thread.... Wasn't that one of the features of Berkeley Pascal? I know separate compilation is a definitely a feature by the time it makes it to SunOS/Solaris-2. I remember trying to figure it out once for a class assignment and the TA scowled at me and asked me why on earth I would ever want to do that, so I never trusted that TA on programming languages, or much of anything else, ever again. Proper and elegant and simple modules for a systems programming language would seem a pre-requisite these days though. Preprocessors and headers be gone! -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From tuhs at tuhs.org Tue Mar 11 09:35:53 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Mon, 10 Mar 2025 23:35:53 +0000 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: <_eoCy4RgQnOHXyXj0k3-VEDDzQ2HTk7DiUCA8NAFq1ozXQtTbadAXjku2d4D0DwSOeTADMrawQPwSzze6u6b94cwkCfFs7uqFCsrq2c9aqI=@protonmail.com> On Monday, March 10th, 2025 at 4:25 PM, Greg A. Woods wrote: > > ... > > Which reminds me, wasn't there was some mention of difficulties with > Pascal et al w.r.t. separate compilation earlier in this thread.... > Wasn't that one of the features of Berkeley Pascal? I know separate > compilation is a definitely a feature by the time it makes it to > SunOS/Solaris-2. I remember trying to figure it out once for a class > assignment and the TA scowled at me and asked me why on earth I would > ever want to do that, so I never trusted that TA on programming > languages, or much of anything else, ever again. > > ... > > -- > Greg A. Woods gwoods at acm.org > > > Kelowna, BC +1 250 762-7675 RoboHack woods at robohack.ca > > Planix, Inc. woods at planix.com Avoncote Farms woods at avoncote.ca If by separate compilation you mean the compiler spits out (possibly optionally) human-readable assembler, I would assume that is nearly crucial for any systems language, otherwise you're putting a lot of faith in an ABI when it comes to linking with your machdep assembly files no? I'd be nervous to use something for bare metal programming that I couldn't easily inspect the resulting assembler code from... - Matt G. From clemc at ccc.com Tue Mar 11 09:49:15 2025 From: clemc at ccc.com (Clem Cole) Date: Mon, 10 Mar 2025 19:49:15 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: Marc - check out OpenSIMH( https://opensimh.org) Check out over 40 different simulators including the I7000 which supports IBM 701,7010,7070,7080, 7090 - https://opensimh.org/simulators/ ᐧ On Mon, Mar 10, 2025 at 7:12 PM Marc Rochkind wrote: > This thread started to be about what I thought were system programming > languages (e.g., C, BLISS) and seems to have meandered into a general > discussion of languages that were around in the 1960s and 1970s, so, what > the heck, I'll add my own story. > > PL/0 is an education programming language introduced in the book, *Algorithms > + Data Structures = Programs*, by Niklaus Wirth in 1976. It's a great > language for teaching compiler writing because it contains interesting > concepts, such as recursive functions, yet isn't overly complicated. I > wrote a PL/0 compiler for the IBM 701 ( > https://github.com/MarcRochkind/pl0compiler). > > Yeah, that's not a misprint. I wrote perhaps the world's only 701 emulator > (https://www.mrochkind.com/mrochkind/a-701.html), and my PL/0 compiler > runs on it. Unfortunately, I can't verify that the compiled code runs on an > actual 701, since I'm sure there haven't been any in operation for many > decades. For those of you who haven't had the pleasure, programming the 701 > is really hard. It had no index registers, and the sign bit didn't > participate in shifts. Still, my compiler compiles full-blown PL/0. > > So there! ;-) > > Marc Rochkind > > On Mon, Mar 10, 2025 at 2:49 PM Bakul Shah via TUHS wrote: > >> Perhaps the interviewer was looking for something dumb like the following >> and not a full RD parser? >> >> int count = 0; >> while (*cp) { >> char c = *cp++; >> count += c == '(' ? 1 : c == ')' ? -1 : 0; >> if (count < 0) return -1; // FAIL: one too many ) >> } >> if (count > 0) return -1; // FAIL: too many ( >> return 0; // SUCCESS >> >> Though this will fall apart if you also want to also balance braces &/or >> brackets and must catch invalid cases like "(..[..)..]"! >> >> > On Mar 10, 2025, at 8:19 AM, John Cowan wrote: >> > >> > I was working at the whiteboard during a job interview once. I had been >> asked to write a function to report if its input had balanced parentheses. >> No problem: I wrote an RD parser in Python (which I prefer for >> whiteboarding) to detect balance and return True if the parse was >> successful and False if EOF was reached. >> > >> > I was starting to write some tests when the interviewer interrupted me. >> > >> > "What is that?" >> > >> > "It's a recursive descent parser. It detects if the input is >> well-formed." >> > >> > Blank look. >> > >> > I started to walk him through the code. >> > >> > He interrupted me. "Excuse me, I'll be back in a few minutes." >> > >> > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the >> recruiter will get back to you." That's the last I hear from them. >> >> > > -- > Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mrochkind at gmail.com Tue Mar 11 09:58:43 2025 From: mrochkind at gmail.com (Marc Rochkind) Date: Mon, 10 Mar 2025 17:58:43 -0600 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: Clem, thanks... didn't know about those. MIne has two features that the other one perhaps doesn't: A fully functional control panel, and it operates at the speed of the 701. On Mon, Mar 10, 2025 at 5:49 PM Clem Cole wrote: > Marc - check out OpenSIMH( https://opensimh.org) > Check out over 40 different simulators including the I7000 which > supports IBM 701,7010,7070,7080, 7090 - https://opensimh.org/simulators/ > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Tue Mar 11 10:01:37 2025 From: clemc at ccc.com (Clem Cole) Date: Mon, 10 Mar 2025 20:01:37 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> Message-ID: On Mon, Mar 10, 2025 at 7:25 PM Greg A. Woods wrote: > > > Which reminds me, wasn't there was some mention of difficulties with > Pascal et al w.r.t. separate compilation earlier in this thread.... > Wasn't that one of the features of Berkeley Pascal? Yes - had extensions for that so: compiling to *.o and using external libraries was possible. man pc will describe the -c switch pc was the hooks into the pcc system for the vax, but I'm not sure if it was backported the PDP-11. It was moved to the 68k by a couple different teams, including Sun for SunOS. Maybe Larry or Rob G can answer if they ever moved it to SPARC. By that time, they had there own compiler suite and may have started to abandon anything based on pcc. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From kenbob at gmail.com Tue Mar 11 10:06:13 2025 From: kenbob at gmail.com (Ken Thompson) Date: Mon, 10 Mar 2025 17:06:13 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: re yacc vs RD i agree that they are about the same, where the edge would tilt based on the parsed language. BUT when the parsed language (like go) is not yet defined, yacc is the only option. On Mon, Mar 10, 2025 at 4:50 PM Clem Cole wrote: > Marc - check out OpenSIMH( https://opensimh.org) > Check out over 40 different simulators including the I7000 which > supports IBM 701,7010,7070,7080, 7090 - https://opensimh.org/simulators/ > > > ᐧ > > On Mon, Mar 10, 2025 at 7:12 PM Marc Rochkind wrote: > >> This thread started to be about what I thought were system programming >> languages (e.g., C, BLISS) and seems to have meandered into a general >> discussion of languages that were around in the 1960s and 1970s, so, what >> the heck, I'll add my own story. >> >> PL/0 is an education programming language introduced in the book, *Algorithms >> + Data Structures = Programs*, by Niklaus Wirth in 1976. It's a great >> language for teaching compiler writing because it contains interesting >> concepts, such as recursive functions, yet isn't overly complicated. I >> wrote a PL/0 compiler for the IBM 701 ( >> https://github.com/MarcRochkind/pl0compiler). >> >> Yeah, that's not a misprint. I wrote perhaps the world's only 701 >> emulator (https://www.mrochkind.com/mrochkind/a-701.html), and my PL/0 >> compiler runs on it. Unfortunately, I can't verify that the compiled code >> runs on an actual 701, since I'm sure there haven't been any in operation >> for many decades. For those of you who haven't had the pleasure, >> programming the 701 is really hard. It had no index registers, and the sign >> bit didn't participate in shifts. Still, my compiler compiles full-blown >> PL/0. >> >> So there! ;-) >> >> Marc Rochkind >> >> On Mon, Mar 10, 2025 at 2:49 PM Bakul Shah via TUHS >> wrote: >> >>> Perhaps the interviewer was looking for something dumb like the following >>> and not a full RD parser? >>> >>> int count = 0; >>> while (*cp) { >>> char c = *cp++; >>> count += c == '(' ? 1 : c == ')' ? -1 : 0; >>> if (count < 0) return -1; // FAIL: one too many ) >>> } >>> if (count > 0) return -1; // FAIL: too many ( >>> return 0; // SUCCESS >>> >>> Though this will fall apart if you also want to also balance braces &/or >>> brackets and must catch invalid cases like "(..[..)..]"! >>> >>> > On Mar 10, 2025, at 8:19 AM, John Cowan wrote: >>> > >>> > I was working at the whiteboard during a job interview once. I had >>> been asked to write a function to report if its input had balanced >>> parentheses. No problem: I wrote an RD parser in Python (which I prefer >>> for whiteboarding) to detect balance and return True if the parse was >>> successful and False if EOF was reached. >>> > >>> > I was starting to write some tests when the interviewer interrupted me. >>> > >>> > "What is that?" >>> > >>> > "It's a recursive descent parser. It detects if the input is >>> well-formed." >>> > >>> > Blank look. >>> > >>> > I started to walk him through the code. >>> > >>> > He interrupted me. "Excuse me, I'll be back in a few minutes." >>> > >>> > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the >>> recruiter will get back to you." That's the last I hear from them. >>> >>> >> >> -- >> Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From douglas.mcilroy at dartmouth.edu Tue Mar 11 10:23:30 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Mon, 10 Mar 2025 20:23:30 -0400 Subject: [TUHS] "Webster's Second on the Head of a Pin"? In-Reply-To: References: Message-ID: Gnu lzip -9 got a 3.92 compression factor , while Morris-Thompson got 4.52. According to the lzip man page, lzip has integrity checks, which must take some space. It is also said to be a "simplified" L-Z, though the only simplification that is mentioned--bounding the size of the dictionary--can only happen on much bigger files than Webster. If you still wonder how well "pure" L-Z does, I'll be happy to try your favorite implementation. It did not surprise me that lzip lost to M-T, for it has to discover over and over again that the dictionary is lexicographically ordered--a fact that gives M-T most of its leverage. The dictionary is definitely not generated by a Markov process! Doug On Mon, Mar 10, 2025 at 6:56 PM James Johnston wrote: > > Doug, do you know how much better Ziv-Lempel does? I'm guessing "a little bit"? > > On Mon, Mar 10, 2025 at 3:52 PM Douglas McIlroy wrote: >> >> I have sent a copy to Warren for the Unix archives. >> >> Doug >> >> On Sat, Dec 28, 2024 at 9:54 PM Royce Williams wrote: >> > >> > Someone I know is seeking the original version of an internal Bell Labs memo from 1974 titled "Webster's Second on the Head of a Pin" by Morris and Thompson. The topic appears to be related to improving the speed of lookups or search. It's cited in a few papers as "Unpublished Technical Memo, Bell Laboratories, Murray Hill, NJ 1974." All I can find online is citations. Any leads appreciated! >> > >> > -- >> > Royce > > > > -- > James D. (jj) Johnston > > Former Chief Scientist, Immersion Networks From crossd at gmail.com Tue Mar 11 11:14:20 2025 From: crossd at gmail.com (Dan Cross) Date: Mon, 10 Mar 2025 21:14:20 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <_eoCy4RgQnOHXyXj0k3-VEDDzQ2HTk7DiUCA8NAFq1ozXQtTbadAXjku2d4D0DwSOeTADMrawQPwSzze6u6b94cwkCfFs7uqFCsrq2c9aqI=@protonmail.com> References: <20250309131831.ol553bn3fo7jdcjb@illithid> <20250309194152.tjgibbix55nhuntg@illithid> <_eoCy4RgQnOHXyXj0k3-VEDDzQ2HTk7DiUCA8NAFq1ozXQtTbadAXjku2d4D0DwSOeTADMrawQPwSzze6u6b94cwkCfFs7uqFCsrq2c9aqI=@protonmail.com> Message-ID: On Mon, Mar 10, 2025 at 8:12 PM segaloco via TUHS wrote: > On Monday, March 10th, 2025 at 4:25 PM, Greg A. Woods wrote: > > ... > > > > Which reminds me, wasn't there was some mention of difficulties with > > Pascal et al w.r.t. separate compilation earlier in this thread.... > > Wasn't that one of the features of Berkeley Pascal? I know separate > > compilation is a definitely a feature by the time it makes it to > > SunOS/Solaris-2. I remember trying to figure it out once for a class > > assignment and the TA scowled at me and asked me why on earth I would > > ever want to do that, so I never trusted that TA on programming > > languages, or much of anything else, ever again. > > If by separate compilation you mean the compiler spits out (possibly optionally) > human-readable assembler, I would assume that is nearly crucial for any systems > language, otherwise you're putting a lot of faith in an ABI when it comes to > linking with your machdep assembly files no? I'd be nervous to use something > for bare metal programming that I couldn't easily inspect the resulting > assembler code from... No, typically that means that you've got two or more source files, say a.pas and b.pas; you separately compile those into object files, such as a.o and b.o, and then link them into a file executable binary. Early Pascal systems were not generally modular in that way, while C was. Yes, you are putting some faith into the linker and ABI, but on early Unix, they came from the same folks who wrote the compiler. Generating assembly listings (or disassembling object files/a,out) is a separate matter. - Dan C. From lm at mcvoy.com Tue Mar 11 11:35:02 2025 From: lm at mcvoy.com (Larry McVoy) Date: Mon, 10 Mar 2025 18:35:02 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: <20250311013502.GF24601@mcvoy.com> I'm guessing this is because yacc makes it easy to fiddle with the grammar? Does performance factor in? I'm actually curious because BitKeeper has what we call a dspec language which lets you wander through the revision history and print it out in a sort of awk/printf like language. If my memory serves me, we had a version in yacc (really flex but same thing) and Rob (cc-ed) rewrote it in a recursive-descent parser for performance reasons. If you are curious, this is a dspec that spits out history in JSON format that I wrote because one of my engineers said it was impossible (it wasn't): http://mcvoy.com/lm/bkdocs/dspec-changes-json-v.txt $0 .. $9 are variables. We used $if as a way to get an if statement rather than just say "if", :whatever: is a way to fish some field out of the history, Marc will get it, it's SCCS's :D: that means date, we just took it a lot further. I don't remember how much faster the RD version was but it was a lot, for sure more than a factor of 2 and maybe much more than that. All I remember is at some point the dspec parser was a performance issue and after Rob rewrote it, it wasn't. On Mon, Mar 10, 2025 at 05:06:13PM -0700, Ken Thompson wrote: > re yacc vs RD > > i agree that they are about the same, > where the edge would tilt based on the parsed language. > BUT when the parsed language (like go) is not yet defined, > yacc is the only option. > > > > On Mon, Mar 10, 2025 at 4:50???PM Clem Cole wrote: > > > Marc - check out OpenSIMH( https://opensimh.org) > > Check out over 40 different simulators including the I7000 which > > supports IBM 701,7010,7070,7080, 7090 - https://opensimh.org/simulators/ > > > > > > ??? > > > > On Mon, Mar 10, 2025 at 7:12???PM Marc Rochkind wrote: > > > >> This thread started to be about what I thought were system programming > >> languages (e.g., C, BLISS) and seems to have meandered into a general > >> discussion of languages that were around in the 1960s and 1970s, so, what > >> the heck, I'll add my own story. > >> > >> PL/0 is an education programming language introduced in the book, *Algorithms > >> + Data Structures = Programs*, by Niklaus Wirth in 1976. It's a great > >> language for teaching compiler writing because it contains interesting > >> concepts, such as recursive functions, yet isn't overly complicated. I > >> wrote a PL/0 compiler for the IBM 701 ( > >> https://github.com/MarcRochkind/pl0compiler). > >> > >> Yeah, that's not a misprint. I wrote perhaps the world's only 701 > >> emulator (https://www.mrochkind.com/mrochkind/a-701.html), and my PL/0 > >> compiler runs on it. Unfortunately, I can't verify that the compiled code > >> runs on an actual 701, since I'm sure there haven't been any in operation > >> for many decades. For those of you who haven't had the pleasure, > >> programming the 701 is really hard. It had no index registers, and the sign > >> bit didn't participate in shifts. Still, my compiler compiles full-blown > >> PL/0. > >> > >> So there! ;-) > >> > >> Marc Rochkind > >> > >> On Mon, Mar 10, 2025 at 2:49???PM Bakul Shah via TUHS > >> wrote: > >> > >>> Perhaps the interviewer was looking for something dumb like the following > >>> and not a full RD parser? > >>> > >>> int count = 0; > >>> while (*cp) { > >>> char c = *cp++; > >>> count += c == '(' ? 1 : c == ')' ? -1 : 0; > >>> if (count < 0) return -1; // FAIL: one too many ) > >>> } > >>> if (count > 0) return -1; // FAIL: too many ( > >>> return 0; // SUCCESS > >>> > >>> Though this will fall apart if you also want to also balance braces &/or > >>> brackets and must catch invalid cases like "(..[..)..]"! > >>> > >>> > On Mar 10, 2025, at 8:19???AM, John Cowan wrote: > >>> > > >>> > I was working at the whiteboard during a job interview once. I had > >>> been asked to write a function to report if its input had balanced > >>> parentheses. No problem: I wrote an RD parser in Python (which I prefer > >>> for whiteboarding) to detect balance and return True if the parse was > >>> successful and False if EOF was reached. > >>> > > >>> > I was starting to write some tests when the interviewer interrupted me. > >>> > > >>> > "What is that?" > >>> > > >>> > "It's a recursive descent parser. It detects if the input is > >>> well-formed." > >>> > > >>> > Blank look. > >>> > > >>> > I started to walk him through the code. > >>> > > >>> > He interrupted me. "Excuse me, I'll be back in a few minutes." > >>> > > >>> > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the > >>> recruiter will get back to you." That's the last I hear from them. > >>> > >>> > >> > >> -- > >> Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. > >> > > -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From johnl at taugh.com Tue Mar 11 12:18:43 2025 From: johnl at taugh.com (John Levine) Date: 10 Mar 2025 22:18:43 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309131831.ol553bn3fo7jdcjb@illithid> Message-ID: <20250311021844.256FDBF852FB@ary.qy> It appears that Greg A. Woods said: >I think one of the most loved innovations C (and B just before it) >brought to programming languages was the use of braces instead of >begin/end, along with the elimination of some of the other extra >unnecessary keywords like "then". ... I suspect that's because they came along just late enough that everything used ASCII, and they got those cool Model 37 Teletypes, so they could assume that devices could generate and print punctuation like { } [ ] | < > The C standards used to have (still have?) trigraph equivalents for much of the punctuation for people on systems that don't have the characters. R's, John From johnl at taugh.com Tue Mar 11 12:26:07 2025 From: johnl at taugh.com (John Levine) Date: 10 Mar 2025 22:26:07 -0400 Subject: [TUHS] uphill both ways, was Re: What would early alternatives to C have been? In-Reply-To: References: Message-ID: <20250311022607.9C13BBF8544C@ary.qy> It appears that Charles H. Sauer said: >In 1971 compiler construction course at UT-Austin, we had to use CDC Fortran to build a compiler for a subset of Algol 60. Undergraduates >used punch cards and RJE to the 6600/6400 shared by all but the privleged few. In my 1971 compiler course at Yale, Alan Perlis made us try to write a compiler that translated a subset of APL into Basic. He suggested we write it in APL, which was a terrible idea, so I wrote it in Trac, for which I happened to have written my own interpreter. I think my compiler was the only one that worked, and it was pretty clever, turning the APL array expressions into structures with array boundaries and example expressions, with no array temporaries. It only generated the loops to evaluate the expressions when storing into another array. Someone got a PhD in 1978 for a similar compiling technique but in 1971 I was a 17 year old twerp so what did I know? R's, John From g.branden.robinson at gmail.com Tue Mar 11 14:00:09 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Mon, 10 Mar 2025 23:00:09 -0500 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250311021844.256FDBF852FB@ary.qy> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309131831.ol553bn3fo7jdcjb@illithid> <20250311021844.256FDBF852FB@ary.qy> Message-ID: <20250311040009.ctldvxphbpxebxvn@illithid> At 2025-03-10T22:18:43-0400, John Levine wrote: > I suspect that's because they came along just late enough that > everything used ASCII, and they got those cool Model 37 Teletypes, so > they could assume that devices could generate and print punctuation > like { } [ ] | < > "begin" and "end" were fine. I've heard a story that Pascal was unpopular in the Middle East; every programming student wanted to know why they had to pay homage to Israeli Prime Minister Menachem Begin in every program, which tells you how old this story is. > The C standards used to have (still have?) trigraph equivalents for > much of the punctuation for people on systems that don't have the > characters. My understanding of the feedback procedure for ISO C23 finalization is that some NBs (national bodies) fought pretty hard to keep trigraphs. I _think_ they're gone from the final version of the standard, but I don't have an official version, just a "final" draft, and my impression of the WG14 meeting minutes from last year was that this particular issue was fought to the bitter end. So maybe they snuck back in at the 11th hour. Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From ggm at algebras.org Tue Mar 11 14:14:56 2025 From: ggm at algebras.org (George Michaelson) Date: Tue, 11 Mar 2025 14:14:56 +1000 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250311040009.ctldvxphbpxebxvn@illithid> References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309131831.ol553bn3fo7jdcjb@illithid> <20250311021844.256FDBF852FB@ary.qy> <20250311040009.ctldvxphbpxebxvn@illithid> Message-ID: Is nobody going to tell stories about the #define of Bourne Shell fame? -G On Tue, Mar 11, 2025 at 2:00 PM G. Branden Robinson < g.branden.robinson at gmail.com> wrote: > At 2025-03-10T22:18:43-0400, John Levine wrote: > > I suspect that's because they came along just late enough that > > everything used ASCII, and they got those cool Model 37 Teletypes, so > > they could assume that devices could generate and print punctuation > > like { } [ ] | < > > > "begin" and "end" were fine. > > I've heard a story that Pascal was unpopular in the Middle East; > every programming student wanted to know why they had to pay homage to > Israeli Prime Minister Menachem Begin in every program, which tells you > how old this story is. > > > The C standards used to have (still have?) trigraph equivalents for > > much of the punctuation for people on systems that don't have the > > characters. > > My understanding of the feedback procedure for ISO C23 finalization is > that some NBs (national bodies) fought pretty hard to keep trigraphs. > > I _think_ they're gone from the final version of the standard, but I > don't have an official version, just a "final" draft, and my impression > of the WG14 meeting minutes from last year was that this particular > issue was fought to the bitter end. > > So maybe they snuck back in at the 11th hour. > > Regards, > Branden > -------------- next part -------------- An HTML attachment was scrubbed... URL: From kenbob at gmail.com Tue Mar 11 15:07:54 2025 From: kenbob at gmail.com (Ken Thompson) Date: Mon, 10 Mar 2025 22:07:54 -0700 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: <20250311013502.GF24601@mcvoy.com> References: <20250311013502.GF24601@mcvoy.com> Message-ID: i find that SD and yacc have about the same time performance. yacc gets a bad rep when it uses lex as its front-end. then it is pig-slow. On Mon, Mar 10, 2025 at 6:52 PM Larry McVoy wrote: > I'm guessing this is because yacc makes it easy to fiddle with the grammar? > Does performance factor in? > > I'm actually curious because BitKeeper has what we call a dspec language > which lets you wander through the revision history and print it out in > a sort of awk/printf like language. If my memory serves me, we had a > version in yacc (really flex but same thing) and Rob (cc-ed) rewrote > it in a recursive-descent parser for performance reasons. If you are > curious, this is a dspec that spits out history in JSON format that > I wrote because one of my engineers said it was impossible (it wasn't): > > http://mcvoy.com/lm/bkdocs/dspec-changes-json-v.txt > > $0 .. $9 are variables. We used $if as a way to get an if statement > rather than just say "if", :whatever: is a way to fish some field out > of the history, Marc will get it, it's SCCS's :D: that means date, we > just took it a lot further. > > I don't remember how much faster the RD version was but it was a lot, > for sure more than a factor of 2 and maybe much more than that. All > I remember is at some point the dspec parser was a performance issue > and after Rob rewrote it, it wasn't. > > On Mon, Mar 10, 2025 at 05:06:13PM -0700, Ken Thompson wrote: > > re yacc vs RD > > > > i agree that they are about the same, > > where the edge would tilt based on the parsed language. > > BUT when the parsed language (like go) is not yet defined, > > yacc is the only option. > > > > > > > > On Mon, Mar 10, 2025 at 4:50???PM Clem Cole wrote: > > > > > Marc - check out OpenSIMH( https://opensimh.org) > > > Check out over 40 different simulators including the I7000 which > > > supports IBM 701,7010,7070,7080, 7090 - > https://opensimh.org/simulators/ > > > > > > > > > ??? > > > > > > On Mon, Mar 10, 2025 at 7:12???PM Marc Rochkind > wrote: > > > > > >> This thread started to be about what I thought were system programming > > >> languages (e.g., C, BLISS) and seems to have meandered into a general > > >> discussion of languages that were around in the 1960s and 1970s, so, > what > > >> the heck, I'll add my own story. > > >> > > >> PL/0 is an education programming language introduced in the book, > *Algorithms > > >> + Data Structures = Programs*, by Niklaus Wirth in 1976. It's a great > > >> language for teaching compiler writing because it contains interesting > > >> concepts, such as recursive functions, yet isn't overly complicated. I > > >> wrote a PL/0 compiler for the IBM 701 ( > > >> https://github.com/MarcRochkind/pl0compiler). > > >> > > >> Yeah, that's not a misprint. I wrote perhaps the world's only 701 > > >> emulator (https://www.mrochkind.com/mrochkind/a-701.html), and my > PL/0 > > >> compiler runs on it. Unfortunately, I can't verify that the compiled > code > > >> runs on an actual 701, since I'm sure there haven't been any in > operation > > >> for many decades. For those of you who haven't had the pleasure, > > >> programming the 701 is really hard. It had no index registers, and > the sign > > >> bit didn't participate in shifts. Still, my compiler compiles > full-blown > > >> PL/0. > > >> > > >> So there! ;-) > > >> > > >> Marc Rochkind > > >> > > >> On Mon, Mar 10, 2025 at 2:49???PM Bakul Shah via TUHS > > >> wrote: > > >> > > >>> Perhaps the interviewer was looking for something dumb like the > following > > >>> and not a full RD parser? > > >>> > > >>> int count = 0; > > >>> while (*cp) { > > >>> char c = *cp++; > > >>> count += c == '(' ? 1 : c == ')' ? -1 : 0; > > >>> if (count < 0) return -1; // FAIL: one too many ) > > >>> } > > >>> if (count > 0) return -1; // FAIL: too many ( > > >>> return 0; // SUCCESS > > >>> > > >>> Though this will fall apart if you also want to also balance braces > &/or > > >>> brackets and must catch invalid cases like "(..[..)..]"! > > >>> > > >>> > On Mar 10, 2025, at 8:19???AM, John Cowan wrote: > > >>> > > > >>> > I was working at the whiteboard during a job interview once. I had > > >>> been asked to write a function to report if its input had balanced > > >>> parentheses. No problem: I wrote an RD parser in Python (which I > prefer > > >>> for whiteboarding) to detect balance and return True if the parse was > > >>> successful and False if EOF was reached. > > >>> > > > >>> > I was starting to write some tests when the interviewer > interrupted me. > > >>> > > > >>> > "What is that?" > > >>> > > > >>> > "It's a recursive descent parser. It detects if the input is > > >>> well-formed." > > >>> > > > >>> > Blank look. > > >>> > > > >>> > I started to walk him through the code. > > >>> > > > >>> > He interrupted me. "Excuse me, I'll be back in a few minutes." > > >>> > > > >>> > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, > the > > >>> recruiter will get back to you." That's the last I hear from them. > > >>> > > >>> > > >> > > >> -- > > >> Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. > > >> > > > > > -- > --- > Larry McVoy Retired to fishing > http://www.mcvoy.com/lm/boat > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cowan at ccil.org Tue Mar 11 15:15:49 2025 From: cowan at ccil.org (John Cowan) Date: Tue, 11 Mar 2025 01:15:49 -0400 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: Message-ID: Very probably. But under pressure I went with what I knew I could get right. In hindsight it was like one of those non-quantitative multiple-choice tests where you have to figure out not the correct answer but the answer that the test author is most likely to believe to be correct. On Mon, Mar 10, 2025, 4:49 PM Bakul Shah wrote: > Perhaps the interviewer was looking for something dumb like the following > and not a full RD parser? > > int count = 0; > while (*cp) { > char c = *cp++; > count += c == '(' ? 1 : c == ')' ? -1 : 0; > if (count < 0) return -1; // FAIL: one too many ) > } > if (count > 0) return -1; // FAIL: too many ( > return 0; // SUCCESS > > Though this will fall apart if you also want to also balance braces &/or > brackets and must catch invalid cases like "(..[..)..]"! > > > On Mar 10, 2025, at 8:19 AM, John Cowan wrote: > > > > I was working at the whiteboard during a job interview once. I had been > asked to write a function to report if its input had balanced parentheses. > No problem: I wrote an RD parser in Python (which I prefer for > whiteboarding) to detect balance and return True if the parse was > successful and False if EOF was reached. > > > > I was starting to write some tests when the interviewer interrupted me. > > > > "What is that?" > > > > "It's a recursive descent parser. It detects if the input is > well-formed." > > > > Blank look. > > > > I started to walk him through the code. > > > > He interrupted me. "Excuse me, I'll be back in a few minutes." > > > > Long wait, maybe 15-20 minutes. Someone else comes in. "Thank you, the > recruiter will get back to you." That's the last I hear from them. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From f4grx at f4grx.net Tue Mar 11 20:21:47 2025 From: f4grx at f4grx.net (Sebastien F4GRX) Date: Tue, 11 Mar 2025 11:21:47 +0100 Subject: [TUHS] DCJ-11 processor with 20k FPGA In-Reply-To: <3da18b77-d02d-4916-b272-2d2c925eec2c@e-bbes.com> References: <3da18b77-d02d-4916-b272-2d2c925eec2c@e-bbes.com> Message-ID: <69b2d02f-22fc-4c7c-b577-8d4612db686d@f4grx.net> Hello, Would that be it? https://github.com/ryomuk/TangNanoDCJ11MEM/tree/main/applications/unix-v1/TangNanoDCJ11MEM_project.20240729.beta/src I have not found synthesis toolchain info. All the doc is written in japanese with no way to get proper english info except via ai translation. Sebastien On 05/03/2025 15:50, emanuel stiebler wrote: > On 2025-03-01 07:11, Jason Stevens via TUHS wrote: >> I assume people have seen this? >> >> https://github.com/ryomuk/TangNanoDCJ11MEM/tree/main >> >> >> It's capable of running Unix v1 & some limited amount of v6 among other >> things.   The FPGA in question the Tang Nano 20k is sub 30GBP >> delivered from >> AliExpress. >> >> Kind of neat to combine a real processor with a simple FPGA >> implementation >> of the hardware. > > I just had a look at it, but he doesn't show the code, which runs on > the TangNano? From trnsz at pobox.com Wed Mar 12 00:47:01 2025 From: trnsz at pobox.com (Jeff Johnson) Date: Tue, 11 Mar 2025 10:47:01 -0400 Subject: [TUHS] "Webster's Second on the Head of a Pin"? In-Reply-To: References: Message-ID: <2152f633-b9c0-43a7-953e-3a4260b98b77@app.fastmail.com> As far as getting maximum compression from lzip, `lzip -9 -s 512Mi` is usually the preferred invocation. I think the default dictionary size at -9 is 64MiB, but that's not that relevant I guess, unless we are comparing different methods and trying to see about what ratios different algorithms get at similar memory usage. -- Jeffrey H. Johnson trnsz at pobox.com On Mon, Mar 10, 2025, at 8:23 PM, Douglas McIlroy wrote: > Gnu lzip -9 got a 3.92 compression factor , while Morris-Thompson got 4.52. -- Jeffrey H. Johnson trnsz at pobox.com On Mon, Mar 10, 2025, at 8:23 PM, Douglas McIlroy wrote: > Gnu lzip -9 got a 3.92 compression factor , while Morris-Thompson got 4.52. From ron at ronnatalie.com Wed Mar 12 01:18:57 2025 From: ron at ronnatalie.com (Ron Natalie) Date: Tue, 11 Mar 2025 15:18:57 +0000 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309131831.ol553bn3fo7jdcjb@illithid> <20250311021844.256FDBF852FB@ary.qy> <20250311040009.ctldvxphbpxebxvn@illithid> Message-ID: Gosh, I had the misfortune of working on the shell (and I think ADB) when they had those darned marcros. I was so happy when 5R2 came around and I found someone had undone them all and turned it back into normal C. ------ Original Message ------ >From "George Michaelson" To tuhs at tuhs.org Date 3/11/2025 12:14:56 AM Subject [TUHS] Re: What would early alternatives to C have been? >Is nobody going to tell stories about the #define of Bourne Shell fame? > >-G -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Wed Mar 12 05:24:24 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Tue, 11 Mar 2025 12:24:24 -0700 Subject: [TUHS] uphill both ways, was Re: What would early alternatives to C have been? Message-ID: <00F9A32D-C233-4458-B2D4-E0994C5EAC9F@iitbombay.org>  On Mar 10, 2025, at 7:26 PM, John Levine wrote: > > In my 1971 compiler course at Yale, Alan Perlis made us try to write a compiler > that translated a subset of APL into Basic. He suggested we write it in APL, > which was a terrible idea, so I wrote it in Trac, for which I happened to have > written my own interpreter. > > I think my compiler was the only one that worked, and it was pretty clever, > turning the APL array expressions into structures with array boundaries and > example expressions, with no array temporaries. It only generated the loops to > evaluate the expressions when storing into another array. > > Someone got a PhD in 1978 for a similar compiling technique but in 1971 I was a > 17 year old twerp so what did I know? > > R's, > John Pretty impressive for a 17yo! Isn’t APL syntax rather context sensitive[1]? Neither yacc nor a RD parser would’ve helped! Unless the subset was limited to a context free subset. Tim Budd in his 1978 work made quite a few changes to APL to ease compilation and used yacc. [I have the book somewhere....] [1] I do not recall if Iverson's original APL had a context sensitive grammar but modern APLs do. Given an expression ‘x y z’, its parse depends on the types of x, y & z. Example: y(x,z) if y is a dyadic verb, x & z are values, x(y(z)) if x & y are monadic verbs, z a value etc. From johnl at taugh.com Wed Mar 12 06:11:52 2025 From: johnl at taugh.com (John R Levine) Date: 11 Mar 2025 16:11:52 -0400 Subject: [TUHS] uphill both ways, was Re: What would early alternatives to C have been? In-Reply-To: <00F9A32D-C233-4458-B2D4-E0994C5EAC9F@iitbombay.org> References: <00F9A32D-C233-4458-B2D4-E0994C5EAC9F@iitbombay.org> Message-ID: <5b269f9a-7241-f149-8910-ab576d147854@taugh.com> On Tue, 11 Mar 2025, Bakul Shah wrote: >> I think my compiler was the only one that worked, and it was pretty clever, >> turning the APL array expressions into structures with array boundaries and >> example expressions, with no array temporaries. It only generated the loops to >> evaluate the expressions when storing into another array. >> >> Someone got a PhD in 1978 for a similar compiling technique but in 1971 I was a >> 17 year old twerp so what did I know? >> >> R's, >> John > > Pretty impressive for a 17yo! > > Isn’t APL syntax rather context sensitive[1]? Neither yacc nor > a RD parser would’ve helped! Unless the subset was limited to a > context free subset. APL at the time had a very simple syntax, once you got past all the special characters. I wrote my compiler on a PDP-10 with ASCII terminals so I just used keywords like IOTA. > Tim Budd in his 1978 work made quite a few changes to APL to > ease compilation and used yacc. [I have the book somewhere....] There's a copy at the Internet Archive. He and I were at Yale at the same time and we knew each other a little, but neither of us was aware that we'd come up with such similar techniques. > Given an expression ‘x y z’, its parse depends on the types of > x, y & z. Example: y(x,z) if y is a dyadic verb, x & z are values, > x(y(z)) if x & y are monadic verbs, z a value etc. You do need to know whether a symbol is a variable or some kind of function, but that's not hard. Regards, John Levine, johnl at taugh.com, Taughannock Networks, Trumansburg NY Please consider the environment before reading this e-mail. https://jl.ly From robpike at gmail.com Wed Mar 12 07:52:12 2025 From: robpike at gmail.com (Rob Pike) Date: Wed, 12 Mar 2025 08:52:12 +1100 Subject: [TUHS] What would early alternatives to C have been? In-Reply-To: References: <20250309194152.tjgibbix55nhuntg@illithid> <20250309131831.ol553bn3fo7jdcjb@illithid> <20250311021844.256FDBF852FB@ary.qy> <20250311040009.ctldvxphbpxebxvn@illithid> Message-ID: The rapscallion who removed the macros, at least for the version I worked on for the v8 shell (sadly missed) was called Steve Bourne. -rob On Wed, Mar 12, 2025 at 2:19 AM Ron Natalie wrote: > > Gosh, I had the misfortune of working on the shell (and I think ADB) when they had those darned marcros. I was so happy when 5R2 came around and I found someone had undone them all and turned it back into normal C. > > > > ------ Original Message ------ > From "George Michaelson" > To tuhs at tuhs.org > Date 3/11/2025 12:14:56 AM > Subject [TUHS] Re: What would early alternatives to C have been? > > Is nobody going to tell stories about the #define of Bourne Shell fame? > > -G From johnl at taugh.com Wed Mar 12 11:36:09 2025 From: johnl at taugh.com (John Levine) Date: 12 Mar 2025 01:36:09 -0000 Subject: [TUHS] parsing tools, was What would early alternatives In-Reply-To: References: Message-ID: According to Ken Thompson : >-=-=-=-=-=- > >i find that SD and yacc have about the same >time performance. yacc gets a bad rep when >it uses lex as its front-end. then it is pig-slow. Parsing is never a performance issue since its time is generally O(n) where n is the number of tokens emitted by the lexer and the constant is not large. The lexer always is because it's the only part of the compiler that has to look at every input character separately. If a compiler does a lot of optimization, the analysis for that can be slow, too. Lex was a dog. Flex, which as far as I know was a complete rewrite that shared no code with lex, are not bad. -- Regards, John Levine, johnl at taugh.com, Primary Perpetrator of "The Internet for Dummies", Please consider the environment before reading this e-mail. https://jl.ly From tytso at mit.edu Wed Mar 12 12:19:32 2025 From: tytso at mit.edu (Theodore Ts'o) Date: Tue, 11 Mar 2025 22:19:32 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? Message-ID: <20250312021932.GA367338@mit.edu> As part of a discusion on the Linux kernel mailing list, there was an assertion that ctime was orginally "creation time". >From the v7 sources in TUHS, we can see: struct dinode { unsigned short di_mode; /* mode and type of file */ short di_nlink; /* number of links to file */ short di_uid; /* owner's user id */ short di_gid; /* owner's group id */ off_t di_size; /* number of bytes in file */ char di_addr[40]; /* disk block addresses */ time_t di_atime; /* time last accessed */ time_t di_mtime; /* time last modified */ time_t di_ctime; /* time created */ }; ... although the v7 kernel sources does seem to update ctime when the inode metadata changes, regardless of what the coment in /usr/src/sys/h/ino.h might say. More interestingly, this comment seems to continue in newer versions up to 3BSD, and then the comments becomes "change time" in BSD 4.2, probably coincident with the File System Implementation? The best we can guess is that the change from "creation time" to "inode change time" happened sometime between 1979 and 1982. Does anyone who was around can give the story about how and when this happened? - Ted From rich.salz at gmail.com Wed Mar 12 12:22:00 2025 From: rich.salz at gmail.com (Rich Salz) Date: Tue, 11 Mar 2025 22:22:00 -0400 Subject: [TUHS] parsing tools, was What would early alternatives In-Reply-To: References: Message-ID: > > Lex was a dog. Flex, which as far as I know was a complete rewrite > that shared no code with lex, are not bad. > Yes, "it's true what they say about lex" was a common phrase. (I think I first saw it in the pathalias paper.) There was a Usenix presentation -- late 80's (Washington?) but I couldn't find it. -- by Vern Paxson about his rewrite called flex. I think he said that the main reason lex was slow was the clever folding it had to do of its internal tables to fit. Sadly Vern never got his paper in, but the presentation showed a group with three lines, one at the top being slow lex, the next two much lower on the Y axis being very close. He said the second line was flex. Someone said what about the third, fastest, line? His answer "that's cat" Vern worked with Van on early network improvements (web-search the both of them for a fun time). Flex first posted to Usenet in 1988, https://groups.google.com/g/comp.sources.unix/c/LZ_EHqd7XBg -------------- next part -------------- An HTML attachment was scrubbed... URL: From robpike at gmail.com Wed Mar 12 12:31:37 2025 From: robpike at gmail.com (Rob Pike) Date: Wed, 12 Mar 2025 13:31:37 +1100 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: Or perhaps the comment was wrong? I do remember being confused by it. -rob On Wed, Mar 12, 2025 at 1:19 PM Theodore Ts'o wrote: > As part of a discusion on the Linux kernel mailing list, there was an > assertion that ctime was orginally "creation time". > > From the v7 sources in TUHS, we can see: > > struct dinode > { > unsigned short di_mode; /* mode and type of file */ > short di_nlink; /* number of links to file */ > short di_uid; /* owner's user id */ > short di_gid; /* owner's group id */ > off_t di_size; /* number of bytes in file */ > char di_addr[40]; /* disk block addresses */ > time_t di_atime; /* time last accessed */ > time_t di_mtime; /* time last modified */ > time_t di_ctime; /* time created */ > }; > > ... although the v7 kernel sources does seem to update ctime when the > inode metadata changes, regardless of what the coment in > /usr/src/sys/h/ino.h might say. > > More interestingly, this comment seems to continue in newer versions > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > probably coincident with the File System Implementation? > > The best we can guess is that the change from "creation time" to > "inode change time" happened sometime between 1979 and 1982. Does > anyone who was around can give the story about how and when this > happened? > > - Ted > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dave at horsfall.org Wed Mar 12 12:33:14 2025 From: dave at horsfall.org (Dave Horsfall) Date: Wed, 12 Mar 2025 13:33:14 +1100 (AEDT) Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: On Tue, 11 Mar 2025, Theodore Ts'o wrote: > As part of a discusion on the Linux kernel mailing list, there was an > assertion that ctime was orginally "creation time". [...] It's been "inode change time" for as long as I can remember (and that's since the 70s); I have no idea what the Penguins are on about, but I've never given them much credence anyway. -- Dave From tytso at mit.edu Wed Mar 12 12:46:54 2025 From: tytso at mit.edu (Theodore Ts'o) Date: Tue, 11 Mar 2025 22:46:54 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: <20250312024654.GC132466@mit.edu> On Wed, Mar 12, 2025 at 01:33:14PM +1100, Dave Horsfall wrote: > On Tue, 11 Mar 2025, Theodore Ts'o wrote: > > > As part of a discusion on the Linux kernel mailing list, there was an > > assertion that ctime was orginally "creation time". > > [...] > > It's been "inode change time" for as long as I can remember (and that's > since the 70s); I have no idea what the Penguins are on about, but I've > never given them much credence anyway. OK, how much credence would you give to Ritchie and Thompson writing in The Bell System Technical Journal, Vol. 57, No. 6, July-August 1978, page 1914[1], where they state that the i-node contains: (v) time of creation, last use, and last modification ^^^^^^^^^^^^^^^^ [1] https://www.tuhs.org/Archive/Documentation/Papers/BSTJ/bstj57-6-1905.pdf There are some of us Penguins who actually cared enough to do some research using the TUHS archives... :-) - Ted From lm at mcvoy.com Wed Mar 12 12:49:03 2025 From: lm at mcvoy.com (Larry McVoy) Date: Tue, 11 Mar 2025 19:49:03 -0700 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: <20250312024902.GH24601@mcvoy.com> I had the exact same reaction, I think I saw the time created and liked that semantic. When it morphed into inode change time, as a source management guy, I'd much rather have create time than changed time since I already got access and modified times. On Wed, Mar 12, 2025 at 01:31:37PM +1100, Rob Pike wrote: > Or perhaps the comment was wrong? > > I do remember being confused by it. > > -rob > > > On Wed, Mar 12, 2025 at 1:19???PM Theodore Ts'o wrote: > > > As part of a discusion on the Linux kernel mailing list, there was an > > assertion that ctime was orginally "creation time". > > > > From the v7 sources in TUHS, we can see: > > > > struct dinode > > { > > unsigned short di_mode; /* mode and type of file */ > > short di_nlink; /* number of links to file */ > > short di_uid; /* owner's user id */ > > short di_gid; /* owner's group id */ > > off_t di_size; /* number of bytes in file */ > > char di_addr[40]; /* disk block addresses */ > > time_t di_atime; /* time last accessed */ > > time_t di_mtime; /* time last modified */ > > time_t di_ctime; /* time created */ > > }; > > > > ... although the v7 kernel sources does seem to update ctime when the > > inode metadata changes, regardless of what the coment in > > /usr/src/sys/h/ino.h might say. > > > > More interestingly, this comment seems to continue in newer versions > > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > > probably coincident with the File System Implementation? > > > > The best we can guess is that the change from "creation time" to > > "inode change time" happened sometime between 1979 and 1982. Does > > anyone who was around can give the story about how and when this > > happened? > > > > - Ted > > -- --- Larry McVoy Retired to fishing http://www.mcvoy.com/lm/boat From rik at rikfarrow.com Wed Mar 12 13:02:19 2025 From: rik at rikfarrow.com (Rik Farrow) Date: Tue, 11 Mar 2025 20:02:19 -0700 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312024902.GH24601@mcvoy.com> References: <20250312021932.GA367338@mit.edu> <20250312024902.GH24601@mcvoy.com> Message-ID: As someone interested in security, I liked the concept of 'inode change time'. Creation time is certainly a change, so that fits too. Changing permissions or overwriting of binaries with trojans was a popular hacking technique in the 90s. Trojan installation scripts would often contain commands (touch) to replace the inode modify times, but that changed the inode change times too, making ctime a more reliable indicator of tampering. The workaround for that was to change the system clock to the desired ctime, modify the binary, then change the system clock back. Fortunately, setting system clocks back arbitrarily no longer works in systems I've used. Perhaps someone knows more about this change? Rik On Tue, Mar 11, 2025 at 7:49 PM Larry McVoy wrote: > I had the exact same reaction, I think I saw the time created and liked > that semantic. When it morphed into inode change time, as a source > management guy, I'd much rather have create time than changed time > since I already got access and modified times. > > On Wed, Mar 12, 2025 at 01:31:37PM +1100, Rob Pike wrote: > > Or perhaps the comment was wrong? > > > > I do remember being confused by it. > > > > -rob > > > > > > On Wed, Mar 12, 2025 at 1:19???PM Theodore Ts'o wrote: > > > > > As part of a discusion on the Linux kernel mailing list, there was an > > > assertion that ctime was orginally "creation time". > > > > > > From the v7 sources in TUHS, we can see: > > > > > > struct dinode > > > { > > > unsigned short di_mode; /* mode and type of file */ > > > short di_nlink; /* number of links to file */ > > > short di_uid; /* owner's user id */ > > > short di_gid; /* owner's group id */ > > > off_t di_size; /* number of bytes in file */ > > > char di_addr[40]; /* disk block addresses */ > > > time_t di_atime; /* time last accessed */ > > > time_t di_mtime; /* time last modified */ > > > time_t di_ctime; /* time created */ > > > }; > > > > > > ... although the v7 kernel sources does seem to update ctime when the > > > inode metadata changes, regardless of what the coment in > > > /usr/src/sys/h/ino.h might say. > > > > > > More interestingly, this comment seems to continue in newer versions > > > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > > > probably coincident with the File System Implementation? > > > > > > The best we can guess is that the change from "creation time" to > > > "inode change time" happened sometime between 1979 and 1982. Does > > > anyone who was around can give the story about how and when this > > > happened? > > > > > > - Ted > > > > > -- > --- > Larry McVoy Retired to fishing > http://www.mcvoy.com/lm/boat > -------------- next part -------------- An HTML attachment was scrubbed... URL: From lm at mcvoy.com Wed Mar 12 13:35:17 2025 From: lm at mcvoy.com (Larry McVoy) Date: Tue, 11 Mar 2025 20:35:17 -0700 Subject: [TUHS] parsing tools, was What would early alternatives In-Reply-To: References: Message-ID: <20250312033517.GK24601@mcvoy.com> On Tue, Mar 11, 2025 at 10:22:00PM -0400, Rich Salz wrote: > Vern worked with Van on early network improvements (web-search the both of > them for a fun time). Van was my networking hero until Drew at Netfix did the two side by side stacks. I'm not sure who is better, but I'm sure they are both better than me and I don't suck. I agree with Rich, check Van out, check Drew out. From imp at bsdimp.com Wed Mar 12 13:38:39 2025 From: imp at bsdimp.com (Warner Losh) Date: Tue, 11 Mar 2025 21:38:39 -0600 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: So I think I have to disagree about what it means in v7, comments notwithstanding. iupdat() has: if(ip->i_flag&ICHG) dp->di_ctime = time; ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few others, this was a quick peek). So even in v7, ctime means any changes to metadata / inode data. At least for some definition of "any". Btw, it looks like this was named IUPD in v6 and did similar things. In v6 it is updated in iput when IUPD is set. But it looks like ctime and mtime are folded together. So v7 is the first edition that has ctime and the code implements any change to metadata from my reading. Warner On Tue, Mar 11, 2025 at 8:19 PM Theodore Ts'o wrote: > As part of a discusion on the Linux kernel mailing list, there was an > assertion that ctime was orginally "creation time". > > From the v7 sources in TUHS, we can see: > > struct dinode > { > unsigned short di_mode; /* mode and type of file */ > short di_nlink; /* number of links to file */ > short di_uid; /* owner's user id */ > short di_gid; /* owner's group id */ > off_t di_size; /* number of bytes in file */ > char di_addr[40]; /* disk block addresses */ > time_t di_atime; /* time last accessed */ > time_t di_mtime; /* time last modified */ > time_t di_ctime; /* time created */ > }; > > ... although the v7 kernel sources does seem to update ctime when the > inode metadata changes, regardless of what the coment in > /usr/src/sys/h/ino.h might say. > > More interestingly, this comment seems to continue in newer versions > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > probably coincident with the File System Implementation? > > The best we can guess is that the change from "creation time" to > "inode change time" happened sometime between 1979 and 1982. Does > anyone who was around can give the story about how and when this > happened? > > - Ted > -------------- next part -------------- An HTML attachment was scrubbed... URL: From woods at robohack.ca Wed Mar 12 15:11:03 2025 From: woods at robohack.ca (Greg A. Woods) Date: Tue, 11 Mar 2025 22:11:03 -0700 Subject: [TUHS] parsing tools, was What would early alternatives In-Reply-To: References: Message-ID: At 12 Mar 2025 01:36:09 -0000, "John Levine" wrote: Subject: [TUHS] Re: parsing tools, was What would early alternatives > > Lex was a dog. Flex, which as far as I know was a complete rewrite > that shared no code with lex, are not bad. I've always understood the 'F' in Flex stood for "Fast". In fact it says as much right in the title of the manual page (right from the first post to Usenet): flex - fast lexical analyzer generator -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From alec.muffett at gmail.com Wed Mar 12 17:10:25 2025 From: alec.muffett at gmail.com (Alec Muffett) Date: Wed, 12 Mar 2025 07:10:25 +0000 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: On Wed, 12 Mar 2025, 05:57 Theodore Ts'o, wrote: > > time_t di_ctime; /* time created */ ... > ... although the v7 kernel sources does seem to update ctime when the > inode metadata changes, regardless of what the coment in > /usr/src/sys/h/ino.h might say. > I've worked at one company where policy was that comments were considered a risk for precisely this reason, and where if code were not written clearly enough to be understood without comments, it was generally considered a bug. Perhaps the comment is just a brain fart? -a -------------- next part -------------- An HTML attachment was scrubbed... URL: From tytso at mit.edu Wed Mar 12 21:42:50 2025 From: tytso at mit.edu (Theodore Ts'o) Date: Wed, 12 Mar 2025 07:42:50 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: <20250312114250.GF8837@mit.edu> On Tue, Mar 11, 2025 at 09:38:39PM -0600, Warner Losh wrote: > So I think I have to disagree about what it means in v7, comments > notwithstanding. > > iupdat() has: > if(ip->i_flag&ICHG) > dp->di_ctime = time; > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few others, > this was a quick peek). > > So even in v7, ctime means any changes to metadata / inode data. At least > for some definition of "any". > > Btw, it looks like this was named IUPD in v6 and did similar things. In v6 > it is updated in iput when IUPD is set. But it looks like ctime and mtime > are folded together. So v7 is the first edition that has ctime and the code > implements any change to metadata from my reading. Yeah, Al Viro (the chief Pengiun VFS maintainer) had made a similar observation to yours. After I pointed out the Ritchie and Thompson Unix paper, Al also pointed out that comment could be found in the filsys.5 man page[1] dating from January 1979, also says "creation time" although that is an excerpt from the ino.h header file, the comment of which, as I said, persisted into 2BSD and 3BSD. [1] https://www.tuhs.org/cgi-bin/utree.pl?file=2.9BSD/usr/man/man5/filsys.5 So this is why we were wondering whether if there was sometime when it actually *was* creation time, perhaps in between the v6 and v7 releases, or whether it was always just a comment brain fart, which also managed to make it into a published paper and the filsys.5 man page, with no one caring enough to fix it until several years went by, and perhaps only at Berkeley. (I'm curious when if ever it got fixed in the AT&T sources, but I haven't gone and checked those sources yet.) It's because we were genuinely interested that I decided to ask on TUHS list. Some of us *do* care about the Unix history. :-) - Ted From robpike at gmail.com Wed Mar 12 21:46:39 2025 From: robpike at gmail.com (Rob Pike) Date: Wed, 12 Mar 2025 22:46:39 +1100 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312114250.GF8837@mit.edu> References: <20250312021932.GA367338@mit.edu> <20250312114250.GF8837@mit.edu> Message-ID: Well, it's creation time right after the file is created. I mean that's kind of a joke, but also possibly an explanation for the comment, if you consider the mindset of the person writing the code at the time. Then they worked on the code some more and things moved along. In short, it could well be an honest, original mistake. -rob On Wed, Mar 12, 2025 at 10:43 PM Theodore Ts'o wrote: > On Tue, Mar 11, 2025 at 09:38:39PM -0600, Warner Losh wrote: > > So I think I have to disagree about what it means in v7, comments > > notwithstanding. > > > > iupdat() has: > > if(ip->i_flag&ICHG) > > dp->di_ctime = time; > > > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few > others, > > this was a quick peek). > > > > So even in v7, ctime means any changes to metadata / inode data. At least > > for some definition of "any". > > > > Btw, it looks like this was named IUPD in v6 and did similar things. In > v6 > > it is updated in iput when IUPD is set. But it looks like ctime and mtime > > are folded together. So v7 is the first edition that has ctime and the > code > > implements any change to metadata from my reading. > > Yeah, Al Viro (the chief Pengiun VFS maintainer) had made a similar > observation to yours. After I pointed out the Ritchie and Thompson > Unix paper, Al also pointed out that comment could be found in the > filsys.5 man page[1] dating from January 1979, also says "creation > time" although that is an excerpt from the ino.h header file, the > comment of which, as I said, persisted into 2BSD and 3BSD. > > [1] > https://www.tuhs.org/cgi-bin/utree.pl?file=2.9BSD/usr/man/man5/filsys.5 > > So this is why we were wondering whether if there was sometime when it > actually *was* creation time, perhaps in between the v6 and v7 > releases, or whether it was always just a comment brain fart, which > also managed to make it into a published paper and the filsys.5 man > page, with no one caring enough to fix it until several years went by, > and perhaps only at Berkeley. (I'm curious when if ever it got fixed > in the AT&T sources, but I haven't gone and checked those sources > yet.) > > It's because we were genuinely interested that I decided to ask on > TUHS list. Some of us *do* care about the Unix history. :-) > > - Ted > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tytso at mit.edu Wed Mar 12 22:07:19 2025 From: tytso at mit.edu (Theodore Ts'o) Date: Wed, 12 Mar 2025 08:07:19 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> <20250312024902.GH24601@mcvoy.com> Message-ID: <20250312120719.GG8837@mit.edu> On Tue, Mar 11, 2025 at 08:02:19PM -0700, Rik Farrow wrote: > As someone interested in security, I liked the concept of 'inode change > time'. Creation time is certainly a change, so that fits too. > > Changing permissions or overwriting of binaries with trojans was a popular > hacking technique in the 90s. Trojan installation scripts would often > contain commands (touch) to replace the inode modify times, but that > changed the inode change times too, making ctime a more reliable indicator > of tampering. I have fond memories of looking at the Larn (or was it Hack/Nethack?) sources and seeing that it checked ctime to prevent people from cheating by saving and restoring the game save file, since ctime couldn't be set without either changing the system clock or directly accessing the file system (or kernel memory) and messing with the on-disk inode directly. > The workaround for that was to change the system clock to the desired > ctime, modify the binary, then change the system clock back. Fortunately, > setting system clocks back arbitrarily no longer works in systems I've > used. Perhaps someone knows more about this change? Multics doesn't allow you to set the the clock back, since they use accurate time as part of their file system correctness semantics. It messes with Multics' unique_id abstraction, with their file system depends on heavily. Setting system clocks arbitrarily *does* work on most Unix/Linux systems I've used, although it can cause strange results[1], and you might have to disable the time daemon first. NSA's SELinux can also prevent you from setting the system clock, so you might have to disable it if it is active and you want to play with system time. [1] There are some hueristics that fsck.ext4 uses which can get fooled if time goes backwards, but they are much more minor compared with what could happen with Multics. > On Tue, Mar 11, 2025 at 7:49 PM Larry McVoy wrote: > > > I had the exact same reaction, I think I saw the time created and liked > > that semantic. When it morphed into inode change time, as a source > > management guy, I'd much rather have create time than changed time > > since I already got access and modified times. Yeah, that was actually the cause of the thread on the Linux Kernel mailing list. There are some people who want Linux to have a "creation time" which works like the Windows time stamp, which can be arbitrarily set by an unprivileged process. The arguments are the ones you've made; in addition, some people want to be able to have a date which survives being copied from one backup file system to another, so they can know when a photo or video was created without having to access the in-line metadata. What Linux currently has a "birth time" field which can't be changed by a utimes(2)-like system call. The rationale for that is this is what XFS had implemented back in the SGI days, and xfs had support for "birth time" with SGI semantics. The argument for that approach is that it can be a security thing, which is useful for people doing forensics analysis. I also wonder if this was used in the construction of NFS file handles (which for other OS's might be done via the i_version or i_generation fields, although the arguments about when *those* fields should be incremented are interesting and there isn't much agreement about that). >From there was someone who argued that Unix had originally had ctime as "creation time", and that ctime was "useless" (by which, that person was arguing that it was useless for *him*). I then pointed out that while there may have been comments going back to V7 days saying "creation time", even the comment was changed to "change time" by 1972 in the BSD 4.3 sources, and we weren't going to change ctime semantics that had been settled for over four decades just because someone wanted Windows compatibility. If the answer is "We have always been at war with Oceania" that would be interesting; which is why I wanted to ask what the history behind the file comment might have been. Was the comment and the Ritchie and Thompson paper *ever* correct on this point? Cheers, - Ted From henry.r.bent at gmail.com Wed Mar 12 22:09:23 2025 From: henry.r.bent at gmail.com (Henry Bent) Date: Wed, 12 Mar 2025 08:09:23 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312114250.GF8837@mit.edu> References: <20250312021932.GA367338@mit.edu> <20250312114250.GF8837@mit.edu> Message-ID: On Wed, 12 Mar 2025 at 07:43, Theodore Ts'o wrote: > On Tue, Mar 11, 2025 at 09:38:39PM -0600, Warner Losh wrote: > > So I think I have to disagree about what it means in v7, comments > > notwithstanding. > > > > iupdat() has: > > if(ip->i_flag&ICHG) > > dp->di_ctime = time; > > > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few > others, > > this was a quick peek). > > > > So even in v7, ctime means any changes to metadata / inode data. At least > > for some definition of "any". > > > > Btw, it looks like this was named IUPD in v6 and did similar things. In > v6 > > it is updated in iput when IUPD is set. But it looks like ctime and mtime > > are folded together. So v7 is the first edition that has ctime and the > code > > implements any change to metadata from my reading. > > Yeah, Al Viro (the chief Pengiun VFS maintainer) had made a similar > observation to yours. After I pointed out the Ritchie and Thompson > Unix paper, Al also pointed out that comment could be found in the > filsys.5 man page[1] dating from January 1979, also says "creation > time" although that is an excerpt from the ino.h header file, the > comment of which, as I said, persisted into 2BSD and 3BSD. > > [1] > https://www.tuhs.org/cgi-bin/utree.pl?file=2.9BSD/usr/man/man5/filsys.5 > > So this is why we were wondering whether if there was sometime when it > actually *was* creation time, perhaps in between the v6 and v7 > releases, or whether it was always just a comment brain fart, which > also managed to make it into a published paper and the filsys.5 man > page, with no one caring enough to fix it until several years went by, > and perhaps only at Berkeley. (I'm curious when if ever it got fixed > in the AT&T sources, but I haven't gone and checked those sources > yet.) The comment in the source had not been changed as of SVR4; however between SVR1 and SVR2 the manpage changed. For example, from looking at the sources on the CSRG DVD: S5_1/usr/man/u_man/man4/inode.4: time_t di_ctime; /\(** time created \(**/ S5_2.0/usr/man/p_man/man4/inode.4: time_t di_ctime; /\(** time of last file status change \(**/ and then again in SVR4: S5_4.0/doc/MANPAGES/4/gen/inode_s5: time_t di_ctime; /\(** time status last changed \(**/ All of the later Research versions (v[8-10]) have time_t di_ctime; /* time created */ in that struct. -Henry -------------- next part -------------- An HTML attachment was scrubbed... URL: From douglas.mcilroy at dartmouth.edu Wed Mar 12 22:41:31 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Wed, 12 Mar 2025 08:41:31 -0400 Subject: [TUHS] "Webster's Second on the Head of a Pin"? In-Reply-To: <2152f633-b9c0-43a7-953e-3a4260b98b77@app.fastmail.com> References: <2152f633-b9c0-43a7-953e-3a4260b98b77@app.fastmail.com> Message-ID: Thanks for the advice. I somehow had inferred from the man page that 512M was the default. In any event, it doesn't move the needle on compression ratio. Doug On Tue, Mar 11, 2025 at 10:47 AM Jeff Johnson wrote: > As far as getting maximum compression from lzip, `lzip -9 -s 512Mi` > is usually the preferred invocation. I think the default dictionary > size at -9 is 64MiB, but that's not that relevant I guess, unless > we are comparing different methods and trying to see about what > ratios different algorithms get at similar memory usage. > > -- > Jeffrey H. Johnson > trnsz at pobox.com > > On Mon, Mar 10, 2025, at 8:23 PM, Douglas McIlroy wrote: > > Gnu lzip -9 got a 3.92 compression factor , while Morris-Thompson got > 4.52. > > -- > Jeffrey H. Johnson > trnsz at pobox.com > > On Mon, Mar 10, 2025, at 8:23 PM, Douglas McIlroy wrote: > > Gnu lzip -9 got a 3.92 compression factor , while Morris-Thompson got > 4.52. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From arnold at skeeve.com Wed Mar 12 23:01:34 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Wed, 12 Mar 2025 07:01:34 -0600 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> <20250312114250.GF8837@mit.edu> Message-ID: <202503121301.52CD1YGQ424092@freefriends.org> Rob Pike wrote: > Well, it's creation time right after the file is created. I mean that's > kind of a joke, but also possibly an explanation for the comment, if you > consider the mindset of the person writing the code at the time. Then they > worked on the code some more and things moved along. > > In short, it could well be an honest, original mistake. This is what it feels like to me. FWIW, the "creation time" comment made it's way into the Unix programming appendix in K&R-1. It was there in the early printings of K&R-2 until I pointed it out to Brian... :-) I don't doubt that it being in K&R was also a source of misunderstanding for many people. Arnold From tuhs at tuhs.org Wed Mar 12 23:32:20 2025 From: tuhs at tuhs.org (Arrigo Triulzi via TUHS) Date: Wed, 12 Mar 2025 14:32:20 +0100 Subject: [TUHS] Sad news: we lost George Coulouris Message-ID: <8C70A575-6B02-4F88-9BC5-A947617BC6A2@alchemistowl.org> Found out today that we lost George Coulouris about a month ago, he was at QMC (then QMW, now Queen Mary, University of London) in CompSci and an old Unix hand (but not only). Obituary from his PhD student (who wrote a Unix editor called “ded”): https://www.theguardian.com/education/2025/jan/19/george-coulouris-obituary -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Thu Mar 13 01:46:38 2025 From: crossd at gmail.com (Dan Cross) Date: Wed, 12 Mar 2025 11:46:38 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: On Wed, Mar 12, 2025 at 1:57 AM Theodore Ts'o wrote: > As part of a discusion on the Linux kernel mailing list, there was an > assertion that ctime was orginally "creation time". > > From the v7 sources in TUHS, we can see: > > struct dinode > { > unsigned short di_mode; /* mode and type of file */ > short di_nlink; /* number of links to file */ > short di_uid; /* owner's user id */ > short di_gid; /* owner's group id */ > off_t di_size; /* number of bytes in file */ > char di_addr[40]; /* disk block addresses */ > time_t di_atime; /* time last accessed */ > time_t di_mtime; /* time last modified */ > time_t di_ctime; /* time created */ > }; > > ... although the v7 kernel sources does seem to update ctime when the > inode metadata changes, regardless of what the coment in > /usr/src/sys/h/ino.h might say. > > More interestingly, this comment seems to continue in newer versions > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > probably coincident with the File System Implementation? > > The best we can guess is that the change from "creation time" to > "inode change time" happened sometime between 1979 and 1982. Does > anyone who was around can give the story about how and when this > happened? I chased this through the various history archives once, and my conclusion was that the original comment was always misleading. As you say, in 7th Edition it's updated in `iupdat` whenever `ICHG` is set in the inode's `i_flag`; so basically whenever the inode itself changes, or when the file is written (ICHG is set in `writei`). If the intent was to capture the creation time, I don't know why `i_ctime` wouldn't be set in `maknode` and its value otherwise unchanged in `iupdat`. My guess is that the comment was the result of initial exploration of an idea that evolved very quickly, and never reflected reality (at least not for very long). Further, I believe BSD simply changed the documentation to reflect reality. It makes sense that they would do this around the time the FFS was implemented (I see it updated in 4.1c.1, at least), when presumably they would have needed to be deep into understanding the existing filesystem's semantics in order to faithfully reproduce them in the new implementation. I imagine someone sitting at a terminal connected to some VAX asking much the same questions we are now. Funny how history repeats itself. - Dan C. From tuhs at tuhs.org Thu Mar 13 02:28:16 2025 From: tuhs at tuhs.org (Bakul Shah via TUHS) Date: Wed, 12 Mar 2025 09:28:16 -0700 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: <53DFD2F7-3819-484A-85B1-42716C7DB136@iitbombay.org> On Mar 12, 2025, at 8:46 AM, Dan Cross wrote: > > My guess is that the comment was the result of initial > exploration of an idea that evolved very quickly, and never reflected > reality (at least not for very long). The comment "time created" is not wrong from a deep philosophical point of view: Every time the file changes it is born again ;-) From rich.salz at gmail.com Thu Mar 13 02:30:48 2025 From: rich.salz at gmail.com (Rich Salz) Date: Wed, 12 Mar 2025 12:30:48 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312120719.GG8837@mit.edu> References: <20250312021932.GA367338@mit.edu> <20250312024902.GH24601@mcvoy.com> <20250312120719.GG8837@mit.edu> Message-ID: On Wed, Mar 12, 2025 at 8:07 AM Theodore Ts'o wrote: > I have fond memories of looking at the Larn (or was it Hack/Nethack?) > sources and seeing that it checked ctime to prevent people from > cheating by saving and restoring the game save file, since ctime > couldn't be set without either changing the system clock or directly > accessing the file system (or kernel memory) and messing with the > on-disk inode directly. Rogue did it first. :) They also store the inode number. -------------- next part -------------- An HTML attachment was scrubbed... URL: From johnl at taugh.com Thu Mar 13 02:35:27 2025 From: johnl at taugh.com (John R Levine) Date: 12 Mar 2025 12:35:27 -0400 Subject: [TUHS] parsing tools, was What would early alternatives In-Reply-To: References: Message-ID: <8a0a29f1-9313-26f2-8cf2-7d32ef207108@taugh.com> On Tue, 11 Mar 2025, Rich Salz wrote: >> Lex was a dog. Flex, which as far as I know was a complete rewrite >> that shared no code with lex, are not bad. > There was a Usenix presentation -- late 80's (Washington?) but I couldn't > find it. -- by Vern Paxson about his rewrite called flex. I think he said > that the main reason lex was slow was the clever folding it had to do of > its internal tables to fit. Maybe, maybe not just great programming. It was mostly written by a summer student named Eric Schmidt. I wonder what happened to him. Regards, John Levine, johnl at taugh.com, Taughannock Networks, Trumansburg NY Please consider the environment before reading this e-mail. https://jl.ly From mrochkind at gmail.com Thu Mar 13 03:58:09 2025 From: mrochkind at gmail.com (Marc Rochkind) Date: Wed, 12 Mar 2025 11:58:09 -0600 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312021932.GA367338@mit.edu> References: <20250312021932.GA367338@mit.edu> Message-ID: According to my 1985 book *Advanced UNIX Programming*, the st_ctime member is "the status-change time, updated when the file is written or when the mode, owner, group, link count, or modification time is changed. Updated when the access time is changed explicitly via utime, but not when the file is read." I don't ever remember st_ctime being the creation time. I probably got that information from whatever UNIX manuals I was reading at the time. Marc On Tue, Mar 11, 2025 at 8:57 PM Theodore Ts'o wrote: > As part of a discusion on the Linux kernel mailing list, there was an > assertion that ctime was orginally "creation time". > > From the v7 sources in TUHS, we can see: > > struct dinode > { > unsigned short di_mode; /* mode and type of file */ > short di_nlink; /* number of links to file */ > short di_uid; /* owner's user id */ > short di_gid; /* owner's group id */ > off_t di_size; /* number of bytes in file */ > char di_addr[40]; /* disk block addresses */ > time_t di_atime; /* time last accessed */ > time_t di_mtime; /* time last modified */ > time_t di_ctime; /* time created */ > }; > > ... although the v7 kernel sources does seem to update ctime when the > inode metadata changes, regardless of what the coment in > /usr/src/sys/h/ino.h might say. > > More interestingly, this comment seems to continue in newer versions > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > probably coincident with the File System Implementation? > > The best we can guess is that the change from "creation time" to > "inode change time" happened sometime between 1979 and 1982. Does > anyone who was around can give the story about how and when this > happened? > > - Ted > -- Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. -------------- next part -------------- An HTML attachment was scrubbed... URL: From hjt at xs4all.nl Thu Mar 13 04:18:06 2025 From: hjt at xs4all.nl (H.J.Thomassen) Date: Wed, 12 Mar 2025 19:18:06 +0100 Subject: [TUHS] Sad news: we lost George Colouris Message-ID: <84a8001a-269c-2f77-0b9c-db55bfb3de3b@xs4all.nl> In July 1974 I visited Bell Labs Murray Hill, and Ken & Dennis showed me around. I was very impressed because we (Univ of Nijmegen, NL) had a PDP-11/45 just like theirs and I knew that machine quite well. It was clear that their software kicked the machine much heavier than our (DEC-original) DOS-system did. But I was a naive student so I wanted more information and asked Ken: are there many UNIX users in Europe? Ken brought us to the library where a Ms. Irma B. Biren, librarian, kept the record of licenses. We found prof. Colouris in London... When I asked whether maybe somebody closer by our place was present, Ken found Gideon Yuval in Tel Aviv. Nobody closer.... Hendrik-Jan Thomassen From dave at horsfall.org Thu Mar 13 05:55:11 2025 From: dave at horsfall.org (Dave Horsfall) Date: Thu, 13 Mar 2025 06:55:11 +1100 (AEDT) Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: <20250312120719.GG8837@mit.edu> References: <20250312021932.GA367338@mit.edu> <20250312024902.GH24601@mcvoy.com> <20250312120719.GG8837@mit.edu> Message-ID: <5bc734de-91a2-2326-08e7-7c39da170927@horsfall.org> On Wed, 12 Mar 2025, Theodore Ts'o wrote: > I have fond memories of looking at the Larn (or was it Hack/Nethack?) > sources and seeing that it checked ctime to prevent people from > cheating by saving and restoring the game save file, since ctime > couldn't be set without either changing the system clock or directly > accessing the file system (or kernel memory) and messing with the > on-disk inode directly. I had great fun with Rogue when symlinks were invented :-) -- Dave From kennethgoodwin56 at gmail.com Thu Mar 13 06:50:23 2025 From: kennethgoodwin56 at gmail.com (Kenneth Goodwin) Date: Wed, 12 Mar 2025 16:50:23 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: I would take that IF clause to mean If ICHG SET then exhibit new behavior and create now means change for ctime field otherwise follow original logic of ctime as being original epoch creation date. Ie both behaviors are supported and depend on a flag bit being set or cleared On Tue, Mar 11, 2025, 11:57 PM Warner Losh wrote: > So I think I have to disagree about what it means in v7, comments > notwithstanding. > > iupdat() has: > if(ip->i_flag&ICHG) > dp->di_ctime = time; > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few others, > this was a quick peek). > > So even in v7, ctime means any changes to metadata / inode data. At least > for some definition of "any". > > Btw, it looks like this was named IUPD in v6 and did similar things. In v6 > it is updated in iput when IUPD is set. But it looks like ctime and mtime > are folded together. So v7 is the first edition that has ctime and the code > implements any change to metadata from my reading. > > Warner > > On Tue, Mar 11, 2025 at 8:19 PM Theodore Ts'o wrote: > >> As part of a discusion on the Linux kernel mailing list, there was an >> assertion that ctime was orginally "creation time". >> >> From the v7 sources in TUHS, we can see: >> >> struct dinode >> { >> unsigned short di_mode; /* mode and type of file */ >> short di_nlink; /* number of links to file */ >> short di_uid; /* owner's user id */ >> short di_gid; /* owner's group id */ >> off_t di_size; /* number of bytes in file */ >> char di_addr[40]; /* disk block addresses */ >> time_t di_atime; /* time last accessed */ >> time_t di_mtime; /* time last modified */ >> time_t di_ctime; /* time created */ >> }; >> >> ... although the v7 kernel sources does seem to update ctime when the >> inode metadata changes, regardless of what the coment in >> /usr/src/sys/h/ino.h might say. >> >> More interestingly, this comment seems to continue in newer versions >> up to 3BSD, and then the comments becomes "change time" in BSD 4.2, >> probably coincident with the File System Implementation? >> >> The best we can guess is that the change from "creation time" to >> "inode change time" happened sometime between 1979 and 1982. Does >> anyone who was around can give the story about how and when this >> happened? >> >> - Ted >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From crossd at gmail.com Thu Mar 13 07:32:20 2025 From: crossd at gmail.com (Dan Cross) Date: Wed, 12 Mar 2025 17:32:20 -0400 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: On Tue, Mar 11, 2025 at 11:47 PM Warner Losh wrote: > So I think I have to disagree about what it means in v7, comments notwithstanding. I'm not sure which part you're disagreeing with; I think Ted had it right. The comment is just wrong, but that's what he said, if I understood correctly. > iupdat() has: > if(ip->i_flag&ICHG) > dp->di_ctime = time; > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few others, this was a quick peek). > > So even in v7, ctime means any changes to metadata / inode data. At least for some definition of "any". > > Btw, it looks like this was named IUPD in v6 and did similar things. In v6 it is updated in iput when IUPD is set. But it looks like ctime and mtime are folded together. So v7 is the first edition that has ctime and the code implements any change to metadata from my reading. In both 6th and 7th edition (and later? Certainly into early BSD), IUPD governs updates to mtime. 6th Edition doesn't have ctime at all, and uses mtime for everything that the combination of mtime and ctime is used for in later systems (modulo things only added later, of course). I'd rephrase what you wrote as saying that what mtime represented in the 6th edition is split between ctime and mtime in 7th edition and later; IUPD is retained but is specific to updating the file contents, while ICHG represents any change to the file, whether contents or metadata. - Dan C. > On Tue, Mar 11, 2025 at 8:19 PM Theodore Ts'o wrote: >> >> As part of a discusion on the Linux kernel mailing list, there was an >> assertion that ctime was orginally "creation time". >> >> From the v7 sources in TUHS, we can see: >> >> struct dinode >> { >> unsigned short di_mode; /* mode and type of file */ >> short di_nlink; /* number of links to file */ >> short di_uid; /* owner's user id */ >> short di_gid; /* owner's group id */ >> off_t di_size; /* number of bytes in file */ >> char di_addr[40]; /* disk block addresses */ >> time_t di_atime; /* time last accessed */ >> time_t di_mtime; /* time last modified */ >> time_t di_ctime; /* time created */ >> }; >> >> ... although the v7 kernel sources does seem to update ctime when the >> inode metadata changes, regardless of what the coment in >> /usr/src/sys/h/ino.h might say. >> >> More interestingly, this comment seems to continue in newer versions >> up to 3BSD, and then the comments becomes "change time" in BSD 4.2, >> probably coincident with the File System Implementation? >> >> The best we can guess is that the change from "creation time" to >> "inode change time" happened sometime between 1979 and 1982. Does >> anyone who was around can give the story about how and when this >> happened? >> >> - Ted From tuhs at tuhs.org Thu Mar 13 09:08:16 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Wed, 12 Mar 2025 23:08:16 +0000 Subject: [TUHS] When/how did ctime change from inode "creation" to "change" time? In-Reply-To: References: <20250312021932.GA367338@mit.edu> Message-ID: On Wednesday, March 12th, 2025 at 2:32 PM, Dan Cross wrote: > On Tue, Mar 11, 2025 at 11:47 PM Warner Losh imp at bsdimp.com wrote: > > > So I think I have to disagree about what it means in v7, comments notwithstanding. > > > I'm not sure which part you're disagreeing with; I think Ted had it > right. The comment is just wrong, but that's what he said, if I > understood correctly. > > > iupdat() has: > > if(ip->i_flag&ICHG) > > dp->di_ctime = time; > > > > ICHG is set in unlink(), chmod(), chown() and utime() (maybe a few others, this was a quick peek). > > > > So even in v7, ctime means any changes to metadata / inode data. At least for some definition of "any". > > > > Btw, it looks like this was named IUPD in v6 and did similar things. In v6 it is updated in iput when IUPD is set. But it looks like ctime and mtime are folded together. So v7 is the first edition that has ctime and the code implements any change to metadata from my reading. > > > In both 6th and 7th edition (and later? Certainly into early BSD), > IUPD governs updates to mtime. 6th Edition doesn't have ctime at all, > and uses mtime for everything that the combination of mtime and ctime > is used for in later systems (modulo things only added later, of > course). I'd rephrase what you wrote as saying that what mtime > represented in the 6th edition is split between ctime and mtime in 7th > edition and later; IUPD is retained but is specific to updating the > file contents, while ICHG represents any change to the file, whether > contents or metadata. > > - Dan C. > > > On Tue, Mar 11, 2025 at 8:19 PM Theodore Ts'o tytso at mit.edu wrote: > > > > > As part of a discusion on the Linux kernel mailing list, there was an > > > assertion that ctime was orginally "creation time". > > > > > > From the v7 sources in TUHS, we can see: > > > > > > struct dinode > > > { > > > unsigned short di_mode; /* mode and type of file / > > > short di_nlink; / number of links to file / > > > short di_uid; / owner's user id / > > > short di_gid; / owner's group id / > > > off_t di_size; / number of bytes in file / > > > char di_addr[40]; / disk block addresses / > > > time_t di_atime; / time last accessed / > > > time_t di_mtime; / time last modified / > > > time_t di_ctime; / time created */ > > > }; > > > > > > ... although the v7 kernel sources does seem to update ctime when the > > > inode metadata changes, regardless of what the coment in > > > /usr/src/sys/h/ino.h might say. > > > > > > More interestingly, this comment seems to continue in newer versions > > > up to 3BSD, and then the comments becomes "change time" in BSD 4.2, > > > probably coincident with the File System Implementation? > > > > > > The best we can guess is that the change from "creation time" to > > > "inode change time" happened sometime between 1979 and 1982. Does > > > anyone who was around can give the story about how and when this > > > happened? > > > > > > - Ted For the record the dinode structure still exists in a header as of "SVR5" aka UnixWare, although it is renamed "xxdinode" and lives with several other "xx" things, so presumably just old stuff lying around. The comment is still the same, "time created". There is also a larger inode structure also in an "xx" file which instead lists "last inode change time". Finally it looks like SCO may have paved over all of this with their own headers, you've got an "sco_st_stat32" struct with st_ctime and a comment of "time of last file status change". The current illumos sources contain an "icommon" structure in which the ic_ctime member is commented "last time inode changed". All that to say it looks like the commentary eventually gets cleaned up in the SysV lineage but the old commentary persists up to SVR4 where it can be found in /usr/src/uts//sys/fs/s5ino.h Needless to say, in iupdat() this appears to consistently be set based on ICHG across these versions so the intention was probably the same all along. - Matt G. From jsg at jsg.id.au Sun Mar 16 16:26:19 2025 From: jsg at jsg.id.au (Jonathan Gray) Date: Sun, 16 Mar 2025 17:26:19 +1100 Subject: [TUHS] A Unix Code Walkthrough by Ken Thompson In-Reply-To: References: Message-ID: On Mon, Jan 13, 2025 at 05:23:11PM +1100, Jonathan Gray wrote: > On Mon, Jan 13, 2025 at 03:11:21PM +1000, Warren Toomey via TUHS wrote: > > On Sun, Dec 29, 2024 at 07:40:15AM +1000, Warren Toomey via TUHS wrote: > > > Hi all, I've just received a set of MP3 recordings from Bob Kridle. He says: > > > > > > These are recordings of Ken Thompson doing a read through of one of > > > an early UNIX kernel code listing with a group of grad students at > > > UC Berkeley while he was a visiting prof. there. > > > > > > The date is roughly 1975. I've put the recordings here along with his > > > e-mails about the recordings: > > > > > > https://www.tuhs.org/Archive/Recordings/1975_Unix_Code_Walkthru/ > > > > Eric Allman has kindly supplied a scan of the source code that ken > > provided for the walkthrough. I've added it to the above location. > > It's interesting in that there are comments at the start of each > > function - the existing 5th Edition code that we have does not > > have these comments. > > With alloc.c the comments are there in v6 and the copyright line > removal matches the changes between v5 and v6. > > The code appears to be somewhere between v6 and the "50 changes" tape > described by Mike O'Brien in: > Applications/Spencer_Tapes/unsw3.tar.gz > usr/sys/v6unix/changenotes I've retyped the source code to make it easier to compare: https://github.com/jonathangray/v6-allman Some of the changes are in other v6-derived trees. Some such as the bootstrap and parity changes I couldn't find elsewhere. From tuhs at tuhs.org Mon Mar 17 02:40:15 2025 From: tuhs at tuhs.org (Jan Schaumann via TUHS) Date: Sun, 16 Mar 2025 12:40:15 -0400 Subject: [TUHS] a bit of Plan9 history Message-ID: Hi, This was seen on Dave Farber's IP list; with many of the Dramatis Personae on this list it seems worthwhile to share it here, too: "What I Saw at the Evolution of Plan 9" by Geoff Collyer https://adi.onl/oral.pdf -Jan From fariborz.t at gmail.com Mon Mar 17 03:56:58 2025 From: fariborz.t at gmail.com (Skip Tavakkolian) Date: Sun, 16 Mar 2025 10:56:58 -0700 Subject: [TUHS] a bit of Plan9 history In-Reply-To: References: Message-ID: For those who would like to know more, this is a featured talk at the upcoming 11th IWP9 in Paris in May. On Sun, Mar 16, 2025, 9:47 AM Jan Schaumann via TUHS wrote: > Hi, > > This was seen on Dave Farber's IP list; with many of > the Dramatis Personae on this list it seems worthwhile > to share it here, too: > > "What I Saw at the Evolution of Plan 9" by Geoff Collyer > > https://adi.onl/oral.pdf > > -Jan > -------------- next part -------------- An HTML attachment was scrubbed... URL: From babydr at baby-dragons.com Sat Mar 15 08:19:59 2025 From: babydr at baby-dragons.com (babydr DBA James W. Laferriere) Date: Fri, 14 Mar 2025 14:19:59 -0800 (AKDT) Subject: [TUHS] Question about compiled programs on a newer machine type be usable ? Message-ID: <62379a5a-a2d7-c06c-34cc-3b996e067281@baby-dragons.com> Hello all , Here I am again , Maybe my ? might be relative to the community . I grabbed an archive of precompiled gnu software , During the extraction I got a 'warning' from a gzip'd tar & decided to dig into the underlying tarball to see if it was trully corrupted or repairable . Tada , I did a silly and less'd one of the binary files and noticed that the underlying file was 'ar'd by /usr/local/alphaev6-dec-osf5.1b/bin/ar , well all said and done My ol' as100 ain't a ev6 . So my Question , Could I use these programs that were created on a ev6 cpu system on my ev4 ? Tia , JimL -- +---------------------------------------------------------------------+ | James W. Laferriere | System Techniques | Give me VMS | | Network & System Engineer | 3237 Holden Road | Give me Linux | | jiml at system-techniques.com | Fairbanks, AK. 99709 | only on AXP | +---------------------------------------------------------------------+ From henry.r.bent at gmail.com Mon Mar 17 09:08:01 2025 From: henry.r.bent at gmail.com (Henry Bent) Date: Sun, 16 Mar 2025 19:08:01 -0400 Subject: [TUHS] Question about compiled programs on a newer machine type be usable ? In-Reply-To: <62379a5a-a2d7-c06c-34cc-3b996e067281@baby-dragons.com> References: <62379a5a-a2d7-c06c-34cc-3b996e067281@baby-dragons.com> Message-ID: On Sun, 16 Mar 2025 at 18:47, babydr DBA James W. Laferriere < babydr at baby-dragons.com> wrote: > Hello all , Here I am again , Maybe my ? might be relative to > the > community . > > I grabbed an archive of precompiled gnu software , During the > extraction I got a 'warning' from a gzip'd tar & decided to dig into the > underlying tarball to see if it was trully corrupted or repairable . > > Tada , I did a silly and less'd one of the binary files and > noticed > that the underlying file was 'ar'd by > /usr/local/alphaev6-dec-osf5.1b/bin/ar , > well all said and done My ol' as100 ain't a ev6 . > > So my Question , Could I use these programs that were created on a ev6 > cpu > system on my ev4 ? Under Tru64 any program compiled for any Alpha CPU will run on any other, but there will be emulation overhead when executing instructions that your CPU doesn't have. On a much slower machine this could be pretty painful. Not sure what you mean by an AS100 - do you have an AS1000? Or an AS200 4/100? -Henry -------------- next part -------------- An HTML attachment was scrubbed... URL: From babydr at baby-dragons.com Mon Mar 17 09:24:58 2025 From: babydr at baby-dragons.com (babydr DBA James W. Laferriere) Date: Sun, 16 Mar 2025 15:24:58 -0800 (AKDT) Subject: [TUHS] Question about compiled programs on a newer machine type be usable ? In-Reply-To: References: <62379a5a-a2d7-c06c-34cc-3b996e067281@baby-dragons.com> Message-ID: Hello Henry , On Sun, 16 Mar 2025, Henry Bent wrote: > On Sun, 16 Mar 2025 at 18:47, babydr DBA James W. Laferriere < > babydr at baby-dragons.com> wrote: > >> Hello all , Here I am again , Maybe my ? might be relative to >> the >> community . >> >> I grabbed an archive of precompiled gnu software , During the >> extraction I got a 'warning' from a gzip'd tar & decided to dig into the >> underlying tarball to see if it was trully corrupted or repairable . >> >> Tada , I did a silly and less'd one of the binary files and >> noticed >> that the underlying file was 'ar'd by >> /usr/local/alphaev6-dec-osf5.1b/bin/ar , >> well all said and done My ol' as100 ain't a ev6 . >> >> So my Question , Could I use these programs that were created on a ev6 >> cpu >> system on my ev4 ? > > Under Tru64 any program compiled for any Alpha CPU will run on any other, > but there will be emulation overhead when executing instructions that your > CPU doesn't have. On a much slower machine this could be pretty painful. > Not sure what you mean by an AS100 - do you have an AS1000? Or an AS200 > 4/100? > > -Henry Yes , AS200 4/100 , Painful I don't need , But it's better than attempting to find the right combinations of shareable libraries & compiler flags are required to cleanly compile most of the gnu stuff . And yes I have been trying mostly unsuccessfully to compile some of the tools/utilities I have grown accustomed to on the linux platforms . Tho I am not a fan of rpm . So ... Off I go again . Tia , JimL -- +---------------------------------------------------------------------+ | James W. Laferriere | System Techniques | Give me VMS | | Network & System Engineer | 3237 Holden Road | Give me Linux | | jiml at system-techniques.com | Fairbanks, AK. 99709 | only on AXP | +---------------------------------------------------------------------+ From arnold at skeeve.com Tue Mar 18 00:50:36 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Mon, 17 Mar 2025 08:50:36 -0600 Subject: [TUHS] MPM macros and code from 2002 Message-ID: <202503171450.52HEoagO448150@freefriends.org> Hi All. A while back I found a copy of the MPM macros and code that I undoubtedly got from Brian Kernighan. Warren has put it in the archives: > Date: Mon, 17 Mar 2025 08:04:55 +1000 > From: Warren Toomey > To: Aharon Robbins > Cc: Warren Toomey > Subject: Re: MPM macros and code > > On Fri, Mar 07, 2025 at 03:11:47PM +0200, Aharon Robbins wrote: > > I have the following file: > > -rw-r--r-- 1 arnold arnold 100770 Feb 17 2002 mpm.shar > > Thanks Arnold, I've just put it here: > > https://www.tuhs.org/Archive/Applications/Typesetting/ > > Cheers! Warren Thanks Warren! Arnold From arnold at skeeve.com Tue Mar 18 00:52:16 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Mon, 17 Mar 2025 08:52:16 -0600 Subject: [TUHS] Caldera release of awk, grep, and libregex from 2001 Message-ID: <202503171452.52HEqGmQ448237@freefriends.org> Hi All. A while back I found the Caldera release of awk, grep and libregex from 2001. The tar file is dated 2012, but the actual code is from earlier. Warren has put it in the archive: > Date: Mon, 17 Mar 2025 08:07:12 +1000 > From: Warren Toomey > To: Aharon Robbins > Cc: Warren Toomey > Subject: Re: Caldera release of awk, grep, and libregex from 2001 > > On Mon, Feb 24, 2025 at 06:42:50PM +0200, Aharon Robbins wrote: > > I'm looking through my Downloads directory to try to clean it up > > a bit. I found this: > > > > $ ls -l osutils.tar.gz > > -rw-rw-r-- 1 arnold arnold 101072 Nov 25 2012 osutils.tar.gz > > Thanks for this as well, it's now at: > https://www.tuhs.org/Archive/Applications/Awk_Grep/ > > Cheers, Warren Thanks Warren! Arnold From jsg at jsg.id.au Tue Mar 18 08:18:04 2025 From: jsg at jsg.id.au (Jonathan Gray) Date: Tue, 18 Mar 2025 09:18:04 +1100 Subject: [TUHS] MPM macros and code from 2002 In-Reply-To: <202503171450.52HEoagO448150@freefriends.org> References: <202503171450.52HEoagO448150@freefriends.org> Message-ID: On Mon, Mar 17, 2025 at 08:50:36AM -0600, arnold at skeeve.com wrote: > Hi All. > > A while back I found a copy of the MPM macros and code > that I undoubtedly got from Brian Kernighan. >From the headers in mpm.shar, this is what Rob Pike posted to the 9fans list in 2002. https://marc.info/?l=9fans&m=111558829312191&w=2 > > Warren has put it in the archives: > > > Date: Mon, 17 Mar 2025 08:04:55 +1000 > > From: Warren Toomey > > To: Aharon Robbins > > Cc: Warren Toomey > > Subject: Re: MPM macros and code > > > > On Fri, Mar 07, 2025 at 03:11:47PM +0200, Aharon Robbins wrote: > > > I have the following file: > > > -rw-r--r-- 1 arnold arnold 100770 Feb 17 2002 mpm.shar > > > > Thanks Arnold, I've just put it here: > > > > https://www.tuhs.org/Archive/Applications/Typesetting/ > > > > Cheers! Warren > > Thanks Warren! > > Arnold > From ality at pbrane.org Tue Mar 18 13:44:10 2025 From: ality at pbrane.org (Anthony Martin) Date: Mon, 17 Mar 2025 20:44:10 -0700 Subject: [TUHS] MPM macros and code from 2002 In-Reply-To: References: <202503171450.52HEoagO448150@freefriends.org> Message-ID: Jonathan Gray once said: > On Mon, Mar 17, 2025 at 08:50:36AM -0600, arnold at skeeve.com wrote: > > A while back I found a copy of the MPM macros and code > > that I undoubtedly got from Brian Kernighan. > > From the headers in mpm.shar, this is what Rob Pike posted to the 9fans > list in 2002. > > https://marc.info/?l=9fans&m=111558829312191&w=2 This is taken from the second edition of Plan 9: https://github.com/plan9foundation/plan9/tree/2e-1995-04-05/sys/src/cmd/pm https://github.com/plan9foundation/plan9/blob/2e-1995-04-05/sys/man/6/mpm Cheers, Anthony From arnold at skeeve.com Tue Mar 18 14:56:18 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Mon, 17 Mar 2025 22:56:18 -0600 Subject: [TUHS] MPM macros and code from 2002 In-Reply-To: References: <202503171450.52HEoagO448150@freefriends.org> Message-ID: <202503180456.52I4uIDK500599@freefriends.org> Anthony Martin wrote: > Jonathan Gray once said: > > On Mon, Mar 17, 2025 at 08:50:36AM -0600, arnold at skeeve.com wrote: > > > A while back I found a copy of the MPM macros and code > > > that I undoubtedly got from Brian Kernighan. > > > > From the headers in mpm.shar, this is what Rob Pike posted to the 9fans > > list in 2002. > > > > https://marc.info/?l=9fans&m=111558829312191&w=2 > > This is taken from the second edition of Plan 9: > > https://github.com/plan9foundation/plan9/tree/2e-1995-04-05/sys/src/cmd/pm > https://github.com/plan9foundation/plan9/blob/2e-1995-04-05/sys/man/6/mpm > > Cheers, > Anthony Yes, my bad. I have gotten many things like this from BWK, so I assumed. In any case, enjoy. Arnold From jsg at jsg.id.au Tue Mar 18 19:39:50 2025 From: jsg at jsg.id.au (Jonathan Gray) Date: Tue, 18 Mar 2025 20:39:50 +1100 Subject: [TUHS] A Unix Code Walkthrough by Ken Thompson In-Reply-To: References: Message-ID: On Sun, Mar 16, 2025 at 05:26:19PM +1100, Jonathan Gray wrote: > On Mon, Jan 13, 2025 at 05:23:11PM +1100, Jonathan Gray wrote: > > On Mon, Jan 13, 2025 at 03:11:21PM +1000, Warren Toomey via TUHS wrote: > > > On Sun, Dec 29, 2024 at 07:40:15AM +1000, Warren Toomey via TUHS wrote: > > > > Hi all, I've just received a set of MP3 recordings from Bob Kridle. He says: > > > > > > > > These are recordings of Ken Thompson doing a read through of one of > > > > an early UNIX kernel code listing with a group of grad students at > > > > UC Berkeley while he was a visiting prof. there. > > > > > > > > The date is roughly 1975. I've put the recordings here along with his > > > > e-mails about the recordings: > > > > > > > > https://www.tuhs.org/Archive/Recordings/1975_Unix_Code_Walkthru/ > > > > > > Eric Allman has kindly supplied a scan of the source code that ken > > > provided for the walkthrough. I've added it to the above location. > > > It's interesting in that there are comments at the start of each > > > function - the existing 5th Edition code that we have does not > > > have these comments. > > > > With alloc.c the comments are there in v6 and the copyright line > > removal matches the changes between v5 and v6. > > > > The code appears to be somewhere between v6 and the "50 changes" tape > > described by Mike O'Brien in: > > Applications/Spencer_Tapes/unsw3.tar.gz > > usr/sys/v6unix/changenotes > > I've retyped the source code to make it easier to compare: > https://github.com/jonathangray/v6-allman > > Some of the changes are in other v6-derived trees. Some such as the > bootstrap and parity changes I couldn't find elsewhere. A scheduler change from the Naval Postgraduate School, described by Ken as "this is a fix from Monterey, and I think it's wrong". Tape 1 (Cleaner) ~ 43:17 Mentioned in the November 1976 UNIX News https://archive.org/details/unix_news_november-1976/page/n1/mode/2up tuhs Documentation/Usenix/Early_Newsletters/197611-unix-news-n11.pdf "Also available by separate request only is a new scheduler, which scans the process table twice; the first pass throws out a process only if that process alone will make room for the new process. NPS claims 25% improvement on a 48K machine, Ken Thompson thinks otherwise. You will have to judge for yourself." And in February 1976, though the text in the scan is hard to read https://archive.org/details/unix_news_feb-10-1976/page/n7/mode/2up tuhs Documentation/Usenix/Early_Newsletters/19760210-unix-news-n3.pdf From rminnich at gmail.com Wed Mar 19 01:59:33 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 08:59:33 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? Message-ID: I know it's out there, I just can't find it. It's also called the "imake rhymes with mistake" paper, a nice companion to "going for baroque..." -------------- next part -------------- An HTML attachment was scrubbed... URL: From steffen at sdaoden.eu Wed Mar 19 02:03:58 2025 From: steffen at sdaoden.eu (Steffen Nurpmeso) Date: Tue, 18 Mar 2025 17:03:58 +0100 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> ron minnich wrote in : |I know it's out there, I just can't find it. Name: xorg-imake Path: /usr/ports/xorg Version: 1.0.10 Release: 1 Description: imake build system URL: http://xorg.freedesktop.org Maintainer: CRUX Xorg Team, xorg-ports at crux dot nu Dependencies: xorg-cf-files,xorg-xorgproto ... name=xorg-imake version=1.0.10 release=1 source=(https://www.x.org/releases/individual/util/imake-$version.tar.xz) build() { cd imake-$version ./configure --prefix=/usr make make DESTDIR=$PKG install } |It's also called the "imake rhymes with mistake" paper, a nice companion |to "going for baroque..." i hate it. --steffen | |Der Kragenbaer, The moon bear, |der holt sich munter he cheerfully and one by one |einen nach dem anderen runter wa.ks himself off |(By Robert Gernhardt) From rminnich at gmail.com Wed Mar 19 02:08:23 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 09:08:23 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> Message-ID: Sorry, I'm looking for the paper, not imake itself :-) I stopped typing imake many decades ago, and feel better for that fact. On Tue, Mar 18, 2025 at 9:04 AM Steffen Nurpmeso wrote: > ron minnich wrote in > : > > |I know it's out there, I just can't find it. > > Name: xorg-imake > Path: /usr/ports/xorg > Version: 1.0.10 > Release: 1 > Description: imake build system > URL: http://xorg.freedesktop.org > Maintainer: CRUX Xorg Team, xorg-ports at crux dot nu > Dependencies: xorg-cf-files,xorg-xorgproto > > ... > > name=xorg-imake > version=1.0.10 > release=1 > source=(https://www.x.org/releases/individual/util/imake-$version.tar.xz > ) > > build() { > cd imake-$version > > ./configure --prefix=/usr > > make > make DESTDIR=$PKG install > } > > |It's also called the "imake rhymes with mistake" paper, a nice companion > |to "going for baroque..." > > i hate it. > > --steffen > | > |Der Kragenbaer, The moon bear, > |der holt sich munter he cheerfully and one by one > |einen nach dem anderen runter wa.ks himself off > |(By Robert Gernhardt) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From henry.r.bent at gmail.com Wed Mar 19 02:14:12 2025 From: henry.r.bent at gmail.com (Henry Bent) Date: Tue, 18 Mar 2025 12:14:12 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: On Tue, 18 Mar 2025 at 12:09, ron minnich wrote: > I know it's out there, I just can't find it. > > It's also called the "imake rhymes with mistake" paper, a nice companion > to "going for baroque..." > I'm not finding anything with that phrase on the web or in Usenet except for NetBSD's pkgsrc, whose official description of imake is "obsolete build tool." Do you have a paper with a specific reference to that title? -Henry -------------- next part -------------- An HTML attachment was scrubbed... URL: From rminnich at gmail.com Wed Mar 19 02:20:43 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 09:20:43 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: I'm having a hard time finding the reference, and I wonder if it was maybe in ;login: "imake: An Obsolete Build Tool" by Paul W. Richards On Tue, Mar 18, 2025 at 9:14 AM Henry Bent wrote: > On Tue, 18 Mar 2025 at 12:09, ron minnich wrote: > >> I know it's out there, I just can't find it. >> >> It's also called the "imake rhymes with mistake" paper, a nice companion >> to "going for baroque..." >> > > I'm not finding anything with that phrase on the web or in Usenet except > for NetBSD's pkgsrc, whose official description of imake is "obsolete build > tool." Do you have a paper with a specific reference to that title? > > -Henry > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Wed Mar 19 02:21:41 2025 From: tuhs at tuhs.org (Alan Coopersmith via TUHS) Date: Tue, 18 Mar 2025 09:21:41 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> Message-ID: <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> On 3/18/25 09:03, Steffen Nurpmeso wrote: > ron minnich wrote in > : > > |I know it's out there, I just can't find it. > name=xorg-imake > version=1.0.10 > release=1 > source=(https://www.x.org/releases/individual/util/imake-$version.tar.xz) Yes, we still maintain Imake upstream at X.Org, but every new release for years has had notes like: The X Window System used imake extensively up through the X11R6.9 release, for both full builds within the source tree and external software. X moved off of imake for its build system in 2005 for X11R7.0 and later releases, but still provides imake for building existing external software programs that have not yet converted, though we are not actively maintaining it for new OS or platform releases. Anyone shipping software still using imake to build should be working on moving to something that is still adding support for new platforms and runtimes. as seen in https://lists.freedesktop.org/pipermail/xorg/2024-January/061524.html We don't even use Imake to build itself - since 2005 it has been built with autoconf & automake, and this year I merged the changes to let it build with meson instead: https://gitlab.freedesktop.org/xorg/util/imake/-/merge_requests/13 But I don't have a copy of the paper referred to in the first email in this thread, and both the sites we link to for Imake information from the Imake README seem to have died now: https://www.snake.net/software/imake-stuff/ https://www.kitebird.com/imake-book/ -- -Alan Coopersmith- alan.coopersmith at oracle.com Oracle Solaris Engineering - https://blogs.oracle.com/solaris From rminnich at gmail.com Wed Mar 19 02:24:35 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 09:24:35 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: https://archive.org/details/login_jan94/page/n31/mode/2up?q=imake "I said, in a review I wrote for this newsletter that Software Portability with imake [Paul DuBois, 1993. Software Portability with imake. O'Reilly & Associ¬ ates, Inc.] is a fine book: well-written, well-edited, and useful. Here, I'll argue that it's a fine book about a dreadful idea: imake. Actually, I've seen enough imake-like make front-ends to suspect that such things are as common as peas. I don't like any of them. Why?" On Tue, Mar 18, 2025 at 9:21 AM Alan Coopersmith < alan.coopersmith at oracle.com> wrote: > On 3/18/25 09:03, Steffen Nurpmeso wrote: > > ron minnich wrote in > > : > > > > |I know it's out there, I just can't find it. > > > name=xorg-imake > > version=1.0.10 > > release=1 > > source=( > https://www.x.org/releases/individual/util/imake-$version.tar.xz) > > Yes, we still maintain Imake upstream at X.Org, but every new release for > years > has had notes like: > > The X Window System used imake extensively up through the X11R6.9 > release, > for both full builds within the source tree and external software. > X moved off of imake for its build system in 2005 for X11R7.0 and later > releases, but still provides imake for building existing external > software > programs that have not yet converted, though we are not actively > maintaining > it for new OS or platform releases. > > Anyone shipping software still using imake to build should be working on > moving to something that is still adding support for new platforms and > runtimes. > > as seen in > https://lists.freedesktop.org/pipermail/xorg/2024-January/061524.html > > We don't even use Imake to build itself - since 2005 it has been built with > autoconf & automake, and this year I merged the changes to let it build > with > meson instead: > https://gitlab.freedesktop.org/xorg/util/imake/-/merge_requests/13 > > But I don't have a copy of the paper referred to in the first email in this > thread, and both the sites we link to for Imake information from the Imake > README seem to have died now: > > https://www.snake.net/software/imake-stuff/ > https://www.kitebird.com/imake-book/ > > -- > -Alan Coopersmith- alan.coopersmith at oracle.com > Oracle Solaris Engineering - https://blogs.oracle.com/solaris > -------------- next part -------------- An HTML attachment was scrubbed... URL: From steffen at sdaoden.eu Wed Mar 19 02:27:31 2025 From: steffen at sdaoden.eu (Steffen Nurpmeso) Date: Tue, 18 Mar 2025 17:27:31 +0100 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: <20250318162731.lH3qB0uB@steffen%sdaoden.eu> Alan Coopersmith via TUHS wrote in <95e2b2eb-4830-4fe1-9492-65bc53d3f297 at oracle.com>: |On 3/18/25 09:03, Steffen Nurpmeso wrote: |> ron minnich wrote in |> : |> |>|I know it's out there, I just can't find it. | |> name=xorg-imake |> version=1.0.10 |> release=1 |> source=(https://www.x.org/releases/individual/util/imake-$version.tar.\ |> xz) | |Yes, we still maintain Imake upstream at X.Org, but every new release \ |for years |has had notes like: ... |But I don't have a copy of the paper referred to in the first email in this |thread, and both the sites we link to for Imake information from the Imake |README seem to have died now: | | https://www.snake.net/software/imake-stuff/ | https://www.kitebird.com/imake-book/ yes, the timeouts i also had from Germany. --steffen | |Der Kragenbaer, The moon bear, |der holt sich munter he cheerfully and one by one |einen nach dem anderen runter wa.ks himself off |(By Robert Gernhardt) From ron at ronnatalie.com Wed Mar 19 02:39:45 2025 From: ron at ronnatalie.com (Ronald Natalie) Date: Tue, 18 Mar 2025 16:39:45 +0000 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: I make relies on having an independently callable C preprocessor that they make assumptions what the output from it look (which is not anything laid down by a standard or the like). We used to use Imake extensively to build our code base, and tehre were platforms I had to bring along my own CPP because the one for the compiler didn’t work for the purpose (mostly issues with space and tab substitutions). We eventually switched to CMAKE. -------------- next part -------------- An HTML attachment was scrubbed... URL: From aek at bitsavers.org Wed Mar 19 02:40:40 2025 From: aek at bitsavers.org (Al Kossow) Date: Tue, 18 Mar 2025 09:40:40 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <20250318162731.lH3qB0uB@steffen%sdaoden.eu> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> <20250318162731.lH3qB0uB@steffen%sdaoden.eu> Message-ID: > | https://www.kitebird.com/imake-book/ https://web.archive.org/web/20011201063421/http://www.kitebird.com/imake-book/#other-info but it's (obviously) mostly about the book From shadow at gmail.com Wed Mar 19 02:44:36 2025 From: shadow at gmail.com (Daria Phoebe Brashear) Date: Tue, 18 Mar 2025 12:44:36 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: Who's got old issues of login? Mine probably were lost when I moved out of my ex's place. J. S. Haemer. Imake rhymes with mistake. ;login:, 19(1):32-3, Jan-Feb 1994. On Tue, Mar 18, 2025 at 12:09 PM ron minnich wrote: > > I know it's out there, I just can't find it. > > It's also called the "imake rhymes with mistake" paper, a nice companion to "going for baroque..." -- Daria Phoebe Brashear AuriStor, Inc dariaphoebe.com From shadow at gmail.com Wed Mar 19 02:47:37 2025 From: shadow at gmail.com (Daria Phoebe Brashear) Date: Tue, 18 Mar 2025 12:47:37 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: Actually, here ya go: https://archive.org/details/login_jan94 On Tue, Mar 18, 2025 at 12:44 PM Daria Phoebe Brashear wrote: > > Who's got old issues of login? Mine probably were lost when I moved > out of my ex's place. > > J. S. Haemer. > Imake rhymes with mistake. > ;login:, 19(1):32-3, Jan-Feb 1994. > > On Tue, Mar 18, 2025 at 12:09 PM ron minnich wrote: > > > > I know it's out there, I just can't find it. > > > > It's also called the "imake rhymes with mistake" paper, a nice companion to "going for baroque..." > > > > -- > Daria Phoebe Brashear > AuriStor, Inc > dariaphoebe.com -- Daria Phoebe Brashear AuriStor, Inc dariaphoebe.com From rminnich at gmail.com Wed Mar 19 03:04:52 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 10:04:52 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: also, related, I did find the original, wonderful, paper about writing hello, world in x11: it was a compressed .ps; it uncompressed fine but almost nothing I have could work with the .ps. gs on osx, evince on linux, ... nothing liked it. Plan 9 ghostscript shows it just fine. so if you get that old paper and have trouble viewing it, there is a fix. On Tue, Mar 18, 2025 at 9:47 AM Daria Phoebe Brashear wrote: > Actually, here ya go: > https://archive.org/details/login_jan94 > > On Tue, Mar 18, 2025 at 12:44 PM Daria Phoebe Brashear > wrote: > > > > Who's got old issues of login? Mine probably were lost when I moved > > out of my ex's place. > > > > J. S. Haemer. > > Imake rhymes with mistake. > > ;login:, 19(1):32-3, Jan-Feb 1994. > > > > On Tue, Mar 18, 2025 at 12:09 PM ron minnich wrote: > > > > > > I know it's out there, I just can't find it. > > > > > > It's also called the "imake rhymes with mistake" paper, a nice > companion to "going for baroque..." > > > > > > > > -- > > Daria Phoebe Brashear > > AuriStor, Inc > > dariaphoebe.com > > > > -- > Daria Phoebe Brashear > AuriStor, Inc > dariaphoebe.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From shadow at gmail.com Wed Mar 19 03:19:45 2025 From: shadow at gmail.com (Daria Phoebe Brashear) Date: Tue, 18 Mar 2025 13:19:45 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: If you mean https://www.nic.funet.fi/pub/unix/security/docs/usenix/usenix/winter88/x-helloworld.ps.gz I ran ps2pdf on it (via homebrew) and it opened right up. On Tue, Mar 18, 2025 at 1:05 PM ron minnich wrote: > > also, related, I did find the original, wonderful, paper about writing hello, world in x11: it was a compressed .ps; it uncompressed fine but almost nothing I have could work with the .ps. > > gs on osx, evince on linux, ... nothing liked it. > > Plan 9 ghostscript shows it just fine. > > so if you get that old paper and have trouble viewing it, there is a fix. > > > > On Tue, Mar 18, 2025 at 9:47 AM Daria Phoebe Brashear wrote: >> >> Actually, here ya go: >> https://archive.org/details/login_jan94 >> >> On Tue, Mar 18, 2025 at 12:44 PM Daria Phoebe Brashear wrote: >> > >> > Who's got old issues of login? Mine probably were lost when I moved >> > out of my ex's place. >> > >> > J. S. Haemer. >> > Imake rhymes with mistake. >> > ;login:, 19(1):32-3, Jan-Feb 1994. >> > >> > On Tue, Mar 18, 2025 at 12:09 PM ron minnich wrote: >> > > >> > > I know it's out there, I just can't find it. >> > > >> > > It's also called the "imake rhymes with mistake" paper, a nice companion to "going for baroque..." >> > >> > >> > >> > -- >> > Daria Phoebe Brashear >> > AuriStor, Inc >> > dariaphoebe.com >> >> >> >> -- >> Daria Phoebe Brashear >> AuriStor, Inc >> dariaphoebe.com -- Daria Phoebe Brashear AuriStor, Inc dariaphoebe.com From rminnich at gmail.com Wed Mar 19 03:23:25 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 10:23:25 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: you're right, that works fine, thanks. no idea why apple preview won't just open .ps files any more, I thought that used to work. On Tue, Mar 18, 2025 at 10:19 AM Daria Phoebe Brashear wrote: > If you mean > > https://www.nic.funet.fi/pub/unix/security/docs/usenix/usenix/winter88/x-helloworld.ps.gz > I ran ps2pdf on it (via homebrew) and it opened right up. > > > > On Tue, Mar 18, 2025 at 1:05 PM ron minnich wrote: > > > > also, related, I did find the original, wonderful, paper about writing > hello, world in x11: it was a compressed .ps; it uncompressed fine but > almost nothing I have could work with the .ps. > > > > gs on osx, evince on linux, ... nothing liked it. > > > > Plan 9 ghostscript shows it just fine. > > > > so if you get that old paper and have trouble viewing it, there is a fix. > > > > > > > > On Tue, Mar 18, 2025 at 9:47 AM Daria Phoebe Brashear > wrote: > >> > >> Actually, here ya go: > >> https://archive.org/details/login_jan94 > >> > >> On Tue, Mar 18, 2025 at 12:44 PM Daria Phoebe Brashear < > shadow at gmail.com> wrote: > >> > > >> > Who's got old issues of login? Mine probably were lost when I moved > >> > out of my ex's place. > >> > > >> > J. S. Haemer. > >> > Imake rhymes with mistake. > >> > ;login:, 19(1):32-3, Jan-Feb 1994. > >> > > >> > On Tue, Mar 18, 2025 at 12:09 PM ron minnich > wrote: > >> > > > >> > > I know it's out there, I just can't find it. > >> > > > >> > > It's also called the "imake rhymes with mistake" paper, a nice > companion to "going for baroque..." > >> > > >> > > >> > > >> > -- > >> > Daria Phoebe Brashear > >> > AuriStor, Inc > >> > dariaphoebe.com > >> > >> > >> > >> -- > >> Daria Phoebe Brashear > >> AuriStor, Inc > >> dariaphoebe.com > > > > -- > Daria Phoebe Brashear > AuriStor, Inc > dariaphoebe.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Wed Mar 19 03:44:06 2025 From: clemc at ccc.com (Clem Cole) Date: Tue, 18 Mar 2025 13:44:06 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: On Tue, Mar 18, 2025 at 1:23 PM ron minnich wrote: > you're right, that works fine, thanks. > Yep - so does Acrobat if you have it. > > no idea why apple preview won't just open .ps files any more, I thought > that used to work. > As Henry once so wisely observed: "*4.2BSD is just like UNIX, only different.*" DEC lived, and, I guess, eventually died, by "The Promise" - that old things kept working on newer releases. Be it VMS, Ultrix, Tru64 or whatever. Maybe not easily, maybe not fast, But they worked. If you were "inside" it, was very difficult to remove an old feature. It may also be that DEC and the like cared about the ISVs and the promise was for more that the end user. Modern systems (to be fair, even penguins) don't seem to care as much about that idea. The license to "improve" something or "deprecate" old behavior is based on a Greek hubris that was taught to many over the years. Apple and Microsoft often seem to have always had a few extra helpings. It seems silly to me, too. BTW: ps2pdf is using ghostscript under the covers, so you should have been able to call it directly. In "brew" pstools kits which ps2df is part, requires it and thus ensures it is installed. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Wed Mar 19 04:15:20 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Tue, 18 Mar 2025 14:15:20 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> On 3/18/25 1:04 PM, ron minnich wrote: > also, related, I did find the original, wonderful, paper about > writing hello, world in x11: it was a compressed .ps; it uncompressed fine > but almost nothing I have could work with the .ps. I have a pdf of that paper. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From woods at robohack.ca Wed Mar 19 05:03:48 2025 From: woods at robohack.ca (Greg A. Woods) Date: Tue, 18 Mar 2025 12:03:48 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: At Tue, 18 Mar 2025 16:39:45 +0000, "Ronald Natalie" wrote: Subject: [TUHS] Re: "Imake: an obsolete build tool" -- anybody got this? > > We eventually switched > to CMAKE. Yikes. That's not what I would call an improvement! I think CMake is about as anti-Unix (in philosophy) as you can get. At least Imake had a clear design and simple basic concept, and in its basic form it could be highly performant. It was simple tool for using other tools in an innovative way. As used by X11 though it was part of what might best be described as a dog's breakfast, but that's only really because the X11 macros kind of grew by committee and had an extraordinary variety of target systems and source code to contend with. CMake has ended up with what I would consider an even uglier and more unpalatable dog's breakfast. It also abuses other tools, requires hours to compile on otherwise usable platforms, and wastes CPU and memory like nothing else. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From g.branden.robinson at gmail.com Wed Mar 19 05:09:09 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Tue, 18 Mar 2025 14:09:09 -0500 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: <20250318190909.c6z4s4srr5hfrwmc@illithid> At 2025-03-18T12:03:48-0700, Greg A. Woods wrote: > At Tue, 18 Mar 2025 16:39:45 +0000, "Ronald Natalie" wrote: > Subject: [TUHS] Re: "Imake: an obsolete build tool" -- anybody got this? > > > > We eventually switched to CMAKE. > > Yikes. That's not what I would call an improvement! > > I think CMake is about as anti-Unix (in philosophy) as you can get. > > At least Imake had a clear design and simple basic concept, and in its > basic form it could be highly performant. It was simple tool for using > other tools in an innovative way. As used by X11 though it was part of > what might best be described as a dog's breakfast, but that's only > really because the X11 macros kind of grew by committee and had an > extraordinary variety of target systems and source code to contend with. > > CMake has ended up with what I would consider an even uglier and more > unpalatable dog's breakfast. It also abuses other tools, requires hours > to compile on otherwise usable platforms, and wastes CPU and memory like > nothing else. I believe the usual rejoinder here is "worse is better". My experiences with CMake are not happy ones. Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From tuhs at tuhs.org Wed Mar 19 05:21:36 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Tue, 18 Mar 2025 19:21:36 +0000 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> Message-ID: On Tuesday, March 18th, 2025 at 12:03 PM, Greg A. Woods wrote: > At Tue, 18 Mar 2025 16:39:45 +0000, "Ronald Natalie" ron at ronnatalie.com wrote: > > Subject: [TUHS] Re: "Imake: an obsolete build tool" -- anybody got this? > > > We eventually switched > > to CMAKE. > > > Yikes. That's not what I would call an improvement! > > I think CMake is about as anti-Unix (in philosophy) as you can get. > > At least Imake had a clear design and simple basic concept, and in its > basic form it could be highly performant. It was simple tool for using > other tools in an innovative way. As used by X11 though it was part of > what might best be described as a dog's breakfast, but that's only > really because the X11 macros kind of grew by committee and had an > extraordinary variety of target systems and source code to contend with. > > CMake has ended up with what I would consider an even uglier and more > unpalatable dog's breakfast. It also abuses other tools, requires hours > to compile on otherwise usable platforms, and wastes CPU and memory like > nothing else. > > -- > Greg A. Woods gwoods at acm.org > > > Kelowna, BC +1 250 762-7675 RoboHack woods at robohack.ca > > Planix, Inc. woods at planix.com Avoncote Farms woods at avoncote.ca CMake does seem to be losing some ground to the meson/ninja combination these days. I haven't used any of these beyond autotools, but the most complex stuff I've built has only depended on Xlib and OpenGL. I just keep an LDFLAGS_ up at the top of my makefiles these days and if I need to target such platform, add $(LDFLAGS_) to my main LDFLAGS definition. Same with CFLAGS and other such variables. Maybe not tenable for a huge sprawling project, but in my case, I'd much rather edit a line or two of a Makefile at build time than maintain the whole added dependency of some specialized build system. Admittedly there are simple little projects I see sometimes with something like autotools draped over the top and it feels like a solution in want of a problem. I put more effort into ensuring things like pkg-config are happy than I would just adding -L/opt/lib to an LDFLAGS variable... - Matt G. From woods at robohack.ca Wed Mar 19 05:29:10 2025 From: woods at robohack.ca (Greg A. Woods) Date: Tue, 18 Mar 2025 12:29:10 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <20250318190909.c6z4s4srr5hfrwmc@illithid> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> <20250318190909.c6z4s4srr5hfrwmc@illithid> Message-ID: To turn this around to a more positive direction.... Imake, and in particular the way it was used in X was of course a product of its times -- it evolved in a world with much more limited tools to build on, and in a world with far more variety in terms of target systems to support. As it was it did a remarkably good job of configuring and building X11 for supported systems, and adding support for a typical "modern" (at the time) Unix-like system was actually quite easy. If PMake (now called BSD Make) had been fully fledged a wee bit earlier it could have done the job of Imake (as well as old make at the same time of course, without the rickety help of cpp), and perhaps have even allowed the creation of a somewhat more elegant set of macros to support the full range of X11 and its target systems. It would even have allowed for parallel builds of what was at the time probably one of the larger and more complex system components around. Modern GNU Make has similar (though some of us would argue more limited and less elegant) features that might have sufficed for X11, but it was also too late to the game I think. Today NetBSD supports a BSD Make build system for the full Xorg suite including Xserver which works quite well, albeit only directly supporting one target system in its current form. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 195 bytes Desc: OpenPGP Digital Signature URL: From ron at ronnatalie.com Wed Mar 19 05:39:39 2025 From: ron at ronnatalie.com (Ron Natalie) Date: Tue, 18 Mar 2025 19:39:39 +0000 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <20250318190909.c6z4s4srr5hfrwmc@illithid> References: <20250318160358.oi-Yxc_r@steffen%sdaoden.eu> <95e2b2eb-4830-4fe1-9492-65bc53d3f297@oracle.com> <20250318190909.c6z4s4srr5hfrwmc@illithid> Message-ID: I didn’t much care for it myself (and fortunately, it wasn’t my job to maintain the build production). It at least gave us common tools for Windoze and UNIX. ------ Original Message ------ >From "G. Branden Robinson" To tuhs at tuhs.org Date 3/18/2025 3:09:09 PM Subject [TUHS] Re: "Imake: an obsolete build tool" -- anybody got this? >At 2025-03-18T12:03:48-0700, Greg A. Woods wrote: >> At Tue, 18 Mar 2025 16:39:45 +0000, "Ronald Natalie" wrote: >> Subject: [TUHS] Re: "Imake: an obsolete build tool" -- anybody got this? >> > >> > We eventually switched to CMAKE. >> >> Yikes. That's not what I would call an improvement! >> >> I think CMake is about as anti-Unix (in philosophy) as you can get. >> >> At least Imake had a clear design and simple basic concept, and in its >> basic form it could be highly performant. It was simple tool for using >> other tools in an innovative way. As used by X11 though it was part of >> what might best be described as a dog's breakfast, but that's only >> really because the X11 macros kind of grew by committee and had an >> extraordinary variety of target systems and source code to contend with. >> >> CMake has ended up with what I would consider an even uglier and more >> unpalatable dog's breakfast. It also abuses other tools, requires hours >> to compile on otherwise usable platforms, and wastes CPU and memory like >> nothing else. > >I believe the usual rejoinder here is "worse is better". > >My experiences with CMake are not happy ones. > >Regards, >Branden From dave at horsfall.org Wed Mar 19 06:05:55 2025 From: dave at horsfall.org (Dave Horsfall) Date: Wed, 19 Mar 2025 07:05:55 +1100 (AEDT) Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> References: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> Message-ID: <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> On Tue, 18 Mar 2025, Chet Ramey via TUHS wrote: > > also, related, I did find the original, wonderful, paper about > > writing hello, world in x11: it was a compressed .ps; it > > uncompressed fine but almost nothing I have could work with the .ps. > > I have a pdf of that paper. About five pages long, as I recall... Pretty please? -- Dave From tuhs at tuhs.org Wed Mar 19 06:40:51 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Tue, 18 Mar 2025 16:40:51 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> References: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> Message-ID: On 3/18/25 4:05 PM, Dave Horsfall wrote: > On Tue, 18 Mar 2025, Chet Ramey via TUHS wrote: > >>> also, related, I did find the original, wonderful, paper about >>> writing hello, world in x11: it was a compressed .ps; it >>> uncompressed fine but almost nothing I have could work with the .ps. >> >> I have a pdf of that paper. > > About five pages long, as I recall... 14. Rosenthal, D., “A Simple X11 Client Program,” USENIX Winter Conference Proceedings, pp. 229-242, Dallas, 1988 (the subtitle is "How hard can it really be to write "Hello, World"?"). https://archive.org/details/1988-proceedings-winter-tech-dallas If you agree that's the one we're talking about, I'll send it to Warren. Chet -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From rminnich at gmail.com Wed Mar 19 07:04:13 2025 From: rminnich at gmail.com (ron minnich) Date: Tue, 18 Mar 2025 14:04:13 -0700 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> Message-ID: that's the one I have. On Tue, Mar 18, 2025 at 1:41 PM Chet Ramey via TUHS wrote: > On 3/18/25 4:05 PM, Dave Horsfall wrote: > > On Tue, 18 Mar 2025, Chet Ramey via TUHS wrote: > > > >>> also, related, I did find the original, wonderful, paper about > >>> writing hello, world in x11: it was a compressed .ps; it > >>> uncompressed fine but almost nothing I have could work with the .ps. > >> > >> I have a pdf of that paper. > > > > About five pages long, as I recall... > > 14. > > Rosenthal, D., “A Simple X11 Client Program,” USENIX Winter Conference > Proceedings, pp. 229-242, Dallas, 1988 > > (the subtitle is "How hard can it really be to write "Hello, World"?"). > > https://archive.org/details/1988-proceedings-winter-tech-dallas > > If you agree that's the one we're talking about, I'll send it to Warren. > > Chet > > -- > ``The lyf so short, the craft so long to lerne.'' - Chaucer > ``Ars longa, vita brevis'' - Hippocrates > Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Wed Mar 19 07:12:52 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Tue, 18 Mar 2025 21:12:52 +0000 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> Message-ID: On Tuesday, March 18th, 2025 at 1:40 PM, Chet Ramey via TUHS wrote: > On 3/18/25 4:05 PM, Dave Horsfall wrote: > > > On Tue, 18 Mar 2025, Chet Ramey via TUHS wrote: > > > > > > also, related, I did find the original, wonderful, paper about > > > > writing hello, world in x11: it was a compressed .ps; it > > > > uncompressed fine but almost nothing I have could work with the .ps. > > > > > > I have a pdf of that paper. > > > > About five pages long, as I recall... > > > 14. > > Rosenthal, D., “A Simple X11 Client Program,” USENIX Winter Conference > Proceedings, pp. 229-242, Dallas, 1988 > > (the subtitle is "How hard can it really be to write "Hello, World"?"). > > https://archive.org/details/1988-proceedings-winter-tech-dallas > > If you agree that's the one we're talking about, I'll send it to Warren. > > Chet > > -- > `The lyf so short, the craft so long to lerne.'' - Chaucer` Ars longa, vita brevis'' - Hippocrates > Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ One thing this article doesn't address in my mind is what precisely is a given user trying to achieve with a "Hello, World" application. On one hand is the "Does my environment function" role in which I think their application may have been a bit heavy handed for. In my mind this would amount to: - Retrieve the display - Create a window in it - Map the window - Loop to allow it to display - Unmap the window - Cleanup With looping to display in particular being based on some non-specific termination that could be as simple as a timer/countdown. After all, all I want to know is can I build and run something targeting Xlib, all the rest is then can I use specific feature of Xlib , for instance, displaying text. Even polling for keyboard events to perhaps catch an Esc press isn't really relevant to the question of will a window map, display in the event loop, then unmap when the event loop is terminated. On the other hand, a "Hello, World" is often used as the least common denominator application that allows text-based feedback to a user, thereby facilitating using that display mechanism for feedback throughout ongoing development. For instance, my minimal "Hello, World" for microcontrollers tends to be just verifying I can drop a single character in the transmitter of a UART and get that character back on the connected tty. Once I have such a putc, I have my first debug mechanism. Not intended as critique either, just some philosophical questions that arose thinking about "what do I want to achieve" regarding a classic "Hello, World" program. They do touch on this in the article too but I think there is a tad bit lost comparing the Xlib implementation to the use of printf. To quote: > These examples demonstrate that programming > applications using only the basic X library interface is > even more difficult and unrewarding than programming > UNIX applications using only the system call interface. But some of the efficacy of this mention is lost without the comparison being between "Hello, World" using nothing but syscalls. For instance all the matters of getting the graphics context correct, color space, resizing, etc. sound like concerns comparable to those far removed from a printf() user by the tty driver, things like ioctl(), etc. Nice read though. Obviously they touched on a philosophical question that can continue to be kicked around in all sorts of ways. - Matt G. From davida at pobox.com Wed Mar 19 21:15:03 2025 From: davida at pobox.com (David Arnold) Date: Wed, 19 Mar 2025 22:15:03 +1100 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: <0930200B-5D9C-485E-A667-53111EB5F42A@pobox.com> > On 19 Mar 2025, at 04:24, ron minnich wrote: > > no idea why apple preview won't just open .ps files any more, I thought that used to work. It did. Preview.app used to support PostScript (and Encapsulated PostScript) but it was dropped in Ventura. See https://support.apple.com/en-us/108775 d -------------- next part -------------- An HTML attachment was scrubbed... URL: From shadow at gmail.com Wed Mar 19 22:32:05 2025 From: shadow at gmail.com (Daria Phoebe Brashear) Date: Wed, 19 Mar 2025 08:32:05 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: Message-ID: On Tue, Mar 18, 2025 at 8:49 PM ron minnich wrote: > > I'm having a hard time finding the reference, and I wonder if it was maybe in ;login: > > "imake: An Obsolete Build Tool" by Paul W. Richards > Which paper has this in its bibliography? -- Daria Phoebe Brashear AuriStor, Inc dariaphoebe.com From tuhs at tuhs.org Wed Mar 19 23:22:56 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Wed, 19 Mar 2025 09:22:56 -0400 Subject: [TUHS] "Imake: an obsolete build tool" -- anybody got this? In-Reply-To: References: <4b6f480d-e410-4fb3-a2e1-0f088a377536@case.edu> <826dba04-ace9-7ae5-86a8-cc84c0be57b4@horsfall.org> Message-ID: <4ed173ce-b9da-4e04-8284-a3a5831910fd@case.edu> On 3/18/25 5:12 PM, segaloco via TUHS wrote: > One thing this article doesn't address in my mind is what precisely is a given > user trying to achieve with a "Hello, World" application. I think it's a statement on complexity, some of it necessary, some not. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From sjenkin at canb.auug.org.au Wed Mar 26 20:16:11 2025 From: sjenkin at canb.auug.org.au (sjenkin at canb.auug.org.au) Date: Wed, 26 Mar 2025 21:16:11 +1100 Subject: [TUHS] Moore's Law: PDP-11 1970 vs TI's low-power small chip for under 20 cents. Message-ID: <99ED970D-DB3B-4267-9EDC-763CB2EA4F73@canb.auug.org.au> The specs aren’t that quite equivalent: 1Kb SRAM vs 48K core memory, 32-bit vs 16-bit CPU and 24Mhz vs 1MHz, but what does that clock mean in MIPS? Anyone want to take a stab at the Moore’s Law constant over 55 years, given the specs non-equivalence? ~2.5M times price change, = 2^22 times. ===================== In 1970, they proposed buying a PDP-11 for about $65,000. [ $535,000 in 2025 according to US Inflation Calculator ] ===================== Texas Instruments Introduces MSPM0C1104 as the Smallest Available Microcontroller Measuring only 1.38mm², this wafer chip-scale package MCU is 38% smaller than existing alternatives. The MSPM0C1104 includes a 24MHz Arm Cortex-M0+ core (32-bit), 16KB of flash memory, 1KB of SRAM, a 12-bit ADC with three channels, and six GPIO pins. It also supports standard communication interfaces, including UART, SPI, and I2C. Additional features include 5V-tolerant I/Os, a 1-channel DMA controller, a CRC-16 accelerator, and various timers, including a 16-bit advanced timer and two 16-bit general-purpose timers. These MCUs operate in an extended temperature range from -40°C to 125°C and support supply voltages from 1.62V to 3.6V. The MSPM0 series starts at $0.16 in 1,000-unit quantities, with multiple configurations available. ===================== -- Steve Jenkin, IT Systems and Design 0412 786 915 (+61 412 786 915) PO Box 38, Kippax ACT 2615, AUSTRALIA mailto:sjenkin at canb.auug.org.au http://members.tip.net.au/~sjenkin From beebe at math.utah.edu Fri Mar 28 01:55:08 2025 From: beebe at math.utah.edu (Nelson H. F. Beebe) Date: Thu, 27 Mar 2025 09:55:08 -0600 Subject: [TUHS] Marc Rochkind's article on SCCS has been published Message-ID: The March 2025 issue of an IEEE journal has published Marc Rochkind's article on SCCS. TUHS list members discussed a draft version of the article last fall. Here is its BibTeX entry: @String{j-IEEE-TRANS-SOFTW-ENG = "IEEE Transactions on Software Engineering"} @Article{Rochkind:2025:RSC, author = "Marc J. Rochkind", title = "A Retrospective on the {Source Code Control System}", journal = j-IEEE-TRANS-SOFTW-ENG, volume = "51", number = "3", pages = "695--699", month = mar, year = "2025", CODEN = "IESEDJ", DOI = "https://doi.org/10.1109/TSE.2024.3524947", ISSN = "0098-5589 (print), 1939-3520 (electronic)", ISSN-L = "0098-5589", bibdate = "Tue Mar 25 08:57:56 2025", bibsource = "https://www.math.utah.edu/pub/tex/bib/ieeetranssoftweng2020.bib", acknowledgement = ack-nhfb, ajournal = "IEEE Trans. Softw. Eng.", fjournal = "IEEE Transactions on Software Engineering", journal-URL = "https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=32", keywords = "Codes; Control systems; CSSC; Mainframes; Merging; Programming; SCCS; Software; software configuration management; Software development management; software engineering; Software engineering; software reliability; Software reliability; software tools; Source coding; source control management; version control systems", } ------------------------------------------------------------------------------- - Nelson H. F. Beebe Tel: +1 801 581 5254 - - University of Utah - - Department of Mathematics, 110 LCB Internet e-mail: beebe at math.utah.edu - - 155 S 1400 E RM 233 beebe at acm.org beebe at computer.org - - Salt Lake City, UT 84112-0090, USA URL: https://www.math.utah.edu/~beebe - ------------------------------------------------------------------------------- From tuhs at tuhs.org Fri Mar 28 02:08:57 2025 From: tuhs at tuhs.org (segaloco via TUHS) Date: Thu, 27 Mar 2025 16:08:57 +0000 Subject: [TUHS] Marc Rochkind's article on SCCS has been published In-Reply-To: References: Message-ID: On Thursday, March 27th, 2025 at 8:55 AM, Nelson H. F. Beebe wrote: > The March 2025 issue of an IEEE journal has published Marc Rochkind's > article on SCCS. TUHS list members discussed a draft version of the > article last fall. Here is its BibTeX entry: > > @String{j-IEEE-TRANS-SOFTW-ENG = "IEEE Transactions on Software Engineering"} > > @Article{Rochkind:2025:RSC, > author = "Marc J. Rochkind", > title = "A Retrospective on the {Source Code Control System}", > journal = j-IEEE-TRANS-SOFTW-ENG, > volume = "51", > number = "3", > pages = "695--699", > month = mar, > year = "2025", > CODEN = "IESEDJ", > DOI = "https://doi.org/10.1109/TSE.2024.3524947", > ISSN = "0098-5589 (print), 1939-3520 (electronic)", > ISSN-L = "0098-5589", > bibdate = "Tue Mar 25 08:57:56 2025", > bibsource = "https://www.math.utah.edu/pub/tex/bib/ieeetranssoftweng2020.bib", > acknowledgement = ack-nhfb, > ajournal = "IEEE Trans. Softw. Eng.", > fjournal = "IEEE Transactions on Software Engineering", > journal-URL = "https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=32", > keywords = "Codes; Control systems; CSSC; Mainframes; Merging; > Programming; SCCS; Software; software configuration > management; Software development management; software > engineering; Software engineering; software > reliability; Software reliability; software tools; > Source coding; source control management; version > control systems", > } > > ------------------------------------------------------------------------------- > - Nelson H. F. Beebe Tel: +1 801 581 5254 - > - University of Utah - > - Department of Mathematics, 110 LCB Internet e-mail: beebe at math.utah.edu - > - 155 S 1400 E RM 233 beebe at acm.org beebe at computer.org - > - Salt Lake City, UT 84112-0090, USA URL: https://www.math.utah.edu/~beebe - > ------------------------------------------------------------------------------- Congrats on publication Marc, and thanks Nelson as always for such meticulous bibliography work! - Matt G. From douglas.mcilroy at dartmouth.edu Fri Mar 28 03:35:16 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Thu, 27 Mar 2025 13:35:16 -0400 Subject: [TUHS] non-Posix input files Message-ID: >> info groff gives semantics for including nonempty files that don't end >> with newline. Such files violate the Posix definition of text file. >> >> Although groff is certainly justified in providing semantics for >> non-Posix text, I suggest that it should warn when it does so. > That's true but I'm hesitant to put groff in the business of wagging its > finger at users feeding it non-strictly-conforming text files when doing > so doesn't cause it any problems. Causing groff problems is an odd criterion. The fact that groff will paste files together unless the first happens to end in a newline is a sign of groff 's internals, not of the underlying problem. A newline missing at the end of a file is typically a symptom of either the incaution of some other program (perhaps an editor) or of a file having been unexpectedly truncated (as by a program abort). The latter cause is common enough to justify warning always, not just about cases that are inconvenient to groff. Groff is what it is, but if the treatment of absent final newlines were up for grabs, I'd argue for the more common solution: in all cases insert a newline and warn. Doug From clemc at ccc.com Fri Mar 28 04:45:01 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 27 Mar 2025 14:45:01 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: Message-ID: Branden Robin wrote: >> info groff gives semantics for including nonempty files that don't end > >> with newline. Such files violate the Posix definition of text file. Not so fast. POSIX does not define a text file (that's an ANSI C-ism - thank you DOS -- more in minute). Even in the original POSIX definition we were very careful to >>never<< do that abomination. What the current POSIX specs says is: 3.164 File An object that can be written to, or read from, or both. A file has certain attributes, including access permissions and type. File types include regular file, character special file, block special file, FIFO special file, symbolic link, socket, and directory. Other types of files may be supported by the implementation. .... 3.317 Regular File A file that is a randomly accessible sequence of bytes, with no further structure imposed by the system. In fact, nowhere in the term "text file" defined (or used — search for it) in the POSIX standard. The problem is that at the same time, ANSI C was being defined in the early 1980s The DOS weenies wanted to support the silly two characters line terminations and old notion of a structured "text file" which PC/MS-DOS had inherited from the DEC and IBM OS's of 1960s and what IBM has called "access methods" in the old days (in UNIX everything is just a stream of bytes thank you or in Multics, everything is segment). The UNIX folks on the C committee were very much against adding the support — which Lattice C for MS-DOS had added at the time but happened to be the most popular C compiler for the DOS target and the number of user of the compiler was growing extremely rapidly. They were also pushing for things like the near and far keywords. It was quite a fight in the C community. (It was one of the additions that caused Dennis to quip: *"**When I read commentary about suggestions for where C should go, I often think back and give thanks that it wasn't developed under the advice of a worldwide crowd."* Anyway, the UNIX folk had to decide which eveils the DOS folks were pushes they could live with and a compromise was the creation of the new "rb" "wb" "ab" crud in the fopen(3) call since we you use "r", "w", "b" as because POSIX allows UNIX to open a what ANSI C called a "text file" since there no such thing under POSIX. But in the ANSI C spec (which, remember, is >>only<< for C source code), it says: 5.2.1 Character sets .... The representation of each member of the source and execution basic character sets shall fit in a byte. In both the source and execution basic character sets, the value of each character after 0 in the above list of decimal digits shall be one greater than the value of the previous. In source files, there shall be some way of indicating the end of each line of text; this International Standard treats such an end-of-line indicator as if it were a single new-line character. In the basic execution character set, there shall be control characters representing alert, backspace, carriage return, and new line. If any other characters are encountered in a source file (except in an identifier, a character constant, a string literal, a header name, a comment, or a preprocessing token that is never converted to a token), the behavior is undefined. The key point in the highlighted text in ANSI C is that it does not *require it*. A problem in the ANSI C standard is that does >>use<< the term "text file" although ANSI C never really defines what one is (and probably could not because of the fight - UNIX would have said -- nope -- no structure required. ANSI C does say in: 7.21.2 Streams ... Environmental limits An implementation shall support text files with lines containing at least 254 characters, including the terminating new-line character. The value of the macro BUFSIZ shall be at least 256. The whole compromise was that you did not have to use termination of type. If the concept of a 'textfile" is local to the implementation. UNIX folks get to keep doing things as they did before, so if you used fopen with "r" or "w" its up to the program to worry about the format of the file "access modes" from the 1960s were not needed. ᐧ -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Fri Mar 28 04:52:59 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Thu, 27 Mar 2025 14:52:59 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: Message-ID: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> On 3/27/25 2:45 PM, Clem Cole wrote: > > Branden Robin wrote: > > > >> info groff gives semantics for including nonempty files that don't end > >> with newline. Such files violate the Posix definition of text file. > > > Not so fast.   POSIX does not define a text file (that's an ANSI C-ism - > thank you DOS -- more in minute). It does. "3.387 Text File A file that contains characters organized into zero or more lines. The lines do not contain NUL characters and none can exceed {LINE_MAX} bytes in length, including the character. Although POSIX.1-2024 does not distinguish between text files and binary files (see the ISO C standard), many utilities only produce predictable or meaningful output when operating on text files. The standard utilities that have such restrictions always specify "text files" in their STDIN or INPUT FILES sections." https://pubs.opengroup.org/onlinepubs/9799919799/basedefs/V1_chap03.html#tag_03_387 A line is defined as: "3.185 Line A sequence of zero or more non- characters plus a terminating character." https://pubs.opengroup.org/onlinepubs/9799919799/basedefs/V1_chap03.html#tag_03_185 -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From douglas.mcilroy at dartmouth.edu Fri Mar 28 04:53:58 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Thu, 27 Mar 2025 14:53:58 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: Message-ID: I think you're looking at the wrong Posix document. Posix Shell and Utilities (at least my ancient copy) says 2.2.2.181 Text file: A file that contains one or more lines. The Lines shall not contain NUL characters ... Doug On Thu, Mar 27, 2025 at 2:45 PM Clem Cole wrote: > > > Branden Robin wrote: > > >> >> info groff gives semantics for including nonempty files that don't end >> >> with newline. Such files violate the Posix definition of text file. > > > Not so fast. POSIX does not define a text file (that's an ANSI C-ism - thank you DOS -- more in minute). > > Even in the original POSIX definition we were very careful to >>never<< do that abomination. What the current POSIX specs says is: > > 3.164 File > An object that can be written to, or read from, or both. A file has certain attributes, including access permissions and type. File types include regular file, character special file, block special file, FIFO special file, symbolic link, socket, and directory. Other types of files may be supported by the implementation. > > .... > > 3.317 Regular File > A file that is a randomly accessible sequence of bytes, with no further structure imposed by the system. > > In fact, nowhere in the term "text file" defined (or used — search for it) in the POSIX standard. The problem is that at the same time, ANSI C was being defined in the early 1980s The DOS weenies wanted to support the silly two characters line terminations and old notion of a structured "text file" which PC/MS-DOS had inherited from the DEC and IBM OS's of 1960s and what IBM has called "access methods" in the old days (in UNIX everything is just a stream of bytes thank you or in Multics, everything is segment). The UNIX folks on the C committee were very much against adding the support — which Lattice C for MS-DOS had added at the time but happened to be the most popular C compiler for the DOS target and the number of user of the compiler was growing extremely rapidly. They were also pushing for things like the near and far keywords. It was quite a fight in the C community. (It was one of the additions that caused Dennis to quip: "When I read commentary about suggestions for where C should go, I often think back and give thanks that it wasn't developed under the advice of a worldwide crowd." > > Anyway, the UNIX folk had to decide which eveils the DOS folks were pushes they could live with and a compromise was the creation of the new "rb" "wb" "ab" crud in the fopen(3) call since we you use "r", "w", "b" as because POSIX allows UNIX to open a what ANSI C called a "text file" since there no such thing under POSIX. > > But in the ANSI C spec (which, remember, is >>only<< for C source code), it says: > > 5.2.1 Character sets > .... The representation of each member of the source and execution basic character sets shall fit in a byte. > In both the source and execution basic character sets, the value of each character after 0 in the above > list of decimal digits shall be one greater than the value of the previous. In source files, there shall > be some way of indicating the end of each line of text; this International Standard treats such an > end-of-line indicator as if it were a single new-line character. In the basic execution character set, > there shall be control characters representing alert, backspace, carriage return, and new line. If any > other characters are encountered in a source file (except in an identifier, a character constant, a string > literal, a header name, a comment, or a preprocessing token that is never converted to a token), the > behavior is undefined. > > The key point in the highlighted text in ANSI C is that it does not require it. A problem in the ANSI C standard is that does >>use<< the term "text file" although ANSI C never really defines what one is (and probably could not because of the fight - UNIX would have said -- nope -- no structure required. ANSI C does say in: > > 7.21.2 Streams > ... > Environmental limits > An implementation shall support text files with lines containing at least 254 characters, including > the terminating new-line character. The value of the macro BUFSIZ shall be at least 256. > > The whole compromise was that you did not have to use termination of type. If the concept of a 'textfile" is local to the implementation. UNIX folks get to keep doing things as they did before, so if you used fopen with "r" or "w" its up to the program to worry about the format of the file "access modes" from the 1960s were not needed. > ᐧ From clemc at ccc.com Fri Mar 28 05:00:15 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 27 Mar 2025 15:00:15 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> Message-ID: Argh -- I standard corrected. We worked hard at the beginning to keep that crap out -- sigh. But at least is does says: POSIX.1-2024 *does not **distinguish between text files and binary files* (see the ISO C standard) So my point is the same: don't blame POSIX- this is an ANSI C issue, and utilities really should not care. ᐧ On Thu, Mar 27, 2025 at 2:53 PM Chet Ramey wrote: > On 3/27/25 2:45 PM, Clem Cole wrote: > > > > Branden Robin wrote: > > > > > > >> info groff gives semantics for including nonempty files that > don't end > > >> with newline. Such files violate the Posix definition of text > file. > > > > > > Not so fast. POSIX does not define a text file (that's an ANSI C-ism - > > thank you DOS -- more in minute). > > It does. > > "3.387 Text File > > A file that contains characters organized into zero or more lines. The > lines do not contain NUL characters and none can exceed {LINE_MAX} bytes in > length, including the character. Although POSIX.1-2024 does not > distinguish between text files and binary files (see the ISO C standard), > many utilities only produce predictable or meaningful output when > operating > on text files. The standard utilities that have such restrictions always > specify "text files" in their STDIN or INPUT FILES sections." > > > https://pubs.opengroup.org/onlinepubs/9799919799/basedefs/V1_chap03.html#tag_03_387 > > A line is defined as: > > "3.185 Line > > A sequence of zero or more non- characters plus a terminating > character." > > > https://pubs.opengroup.org/onlinepubs/9799919799/basedefs/V1_chap03.html#tag_03_185 > > -- > ``The lyf so short, the craft so long to lerne.'' - Chaucer > ``Ars longa, vita brevis'' - Hippocrates > Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Fri Mar 28 05:05:52 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Thu, 27 Mar 2025 15:05:52 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> Message-ID: <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> On 3/27/25 3:00 PM, Clem Cole wrote: > Argh -- I standard corrected.  We worked hard at the beginning to keep that > crap out -- sigh. > > But  at least is does says: POSIX.1-2024 /_does not _//_distinguish between > text files and binary files_/ (see the ISO C standard) It also says "The standard utilities that have such restrictions always specify "text files" in their STDIN or INPUT FILES sections," so you can't avoid it. awk is one such utility (sh is not). This is an application requirement, so awk is required to add a newline at the end of a file that does not have one. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From clemc at ccc.com Fri Mar 28 05:35:05 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 27 Mar 2025 15:35:05 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: Chet - as I said, we tried so hard to keep that kind of crap out. Dennis was right. FWIW: with UNIX (POSIX), input will end if an EOF and the ANSI C untilies will obey it or a newline, so you can write the code to work fine either way. But that's a choice of implementation/what subroutines - how you think about the data. I'll accept that that is what the words say in the >>awk<< specification document, but as one of the original authors of the first UNIX standard and the later POSIX standard I can say we tried hard to mak sure we got it right and follow the idea: A regular file has no structure and never to allow the standard to impose it. I think the core standard still says that, and the basic idea is unchanged. The actual structure of the input file is an application idea, not a UNIX/POSIX defined idea. The issue here is the term POSIX. Do you mean it to be the kernel (.1) and if a >>specific<< application with .2 (the C compiler itself, awk, ed) which might put structure onto the file and that's fine. The >>OS<< does not set the structure — it is done by something else. I understand having the application do it; I wish it did not. Many applications (even text editors) can (and have) been written without needing one specific structure, which is my point. I also accept that the folks who took over the standard in the name of "progress" changed (relaxed) much of what we worked so hard to avoid, knowing there were dragons - particularly WRT to textual information. We really did not want to repeat the errors of the 1960s. *i.e.*, as George Santayana originally wrote, *“Those who cannot remember the past are condemned to repeat it.” * ᐧ On Thu, Mar 27, 2025 at 3:05 PM Chet Ramey wrote: > On 3/27/25 3:00 PM, Clem Cole wrote: > > Argh -- I standard corrected. We worked hard at the beginning to keep > that > > crap out -- sigh. > > > > But at least is does says: POSIX.1-2024 /_does not _//_distinguish > between > > text files and binary files_/ (see the ISO C standard) > > It also says "The standard utilities that have such restrictions always > specify "text files" in their STDIN or INPUT FILES sections," so you can't > avoid it. > > awk is one such utility (sh is not). This is an application requirement, so > awk is required to add a newline at the end of a file that does not have > one. > > -- > ``The lyf so short, the craft so long to lerne.'' - Chaucer > ``Ars longa, vita brevis'' - Hippocrates > Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From douglas.mcilroy at dartmouth.edu Fri Mar 28 05:50:50 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Thu, 27 Mar 2025 15:50:50 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: It seems my reply to Clem went astray. POSIX Part 2, Shell and Utilities, is very clear: 2..2.2.181 text file: A File that contains characters organized into one or more lines. The lines shall not contain NUL characters ... 2.2.2.95 line: A sequence of zero or more non- characters plus a terminating character, Oddly--and in my opinion wrongly--the standard excludes empty files. It would be a shock if an editor or groff refused to process an empty file and thereby broke Kernighan's law, "Do nothing gracefully". Doug On Thu, Mar 27, 2025 at 3:35 PM Clem Cole wrote: > > Chet - as I said, we tried so hard to keep that kind of crap out. Dennis was right. FWIW: with UNIX (POSIX), input will end if an EOF and the ANSI C untilies will obey it or a newline, so you can write the code to work fine either way. But that's a choice of implementation/what subroutines - how you think about the data. > > I'll accept that that is what the words say in the >>awk<< specification document, but as one of the original authors of the first UNIX standard and the later POSIX standard I can say we tried hard to mak sure we got it right and follow the idea: A regular file has no structure and never to allow the standard to impose it. I think the core standard still says that, and the basic idea is unchanged. The actual structure of the input file is an application idea, not a UNIX/POSIX defined idea. > > The issue here is the term POSIX. Do you mean it to be the kernel (.1) and if a >>specific<< application with .2 (the C compiler itself, awk, ed) which might put structure onto the file and that's fine. The >>OS<< does not set the structure — it is done by something else. > > I understand having the application do it; I wish it did not. Many applications (even text editors) can (and have) been written without needing one specific structure, which is my point. I also accept that the folks who took over the standard in the name of "progress" changed (relaxed) much of what we worked so hard to avoid, knowing there were dragons - particularly WRT to textual information. We really did not want to repeat the errors of the 1960s. i.e., as George Santayana originally wrote, “Those who cannot remember the past are condemned to repeat it.” > ᐧ > > On Thu, Mar 27, 2025 at 3:05 PM Chet Ramey wrote: >> >> On 3/27/25 3:00 PM, Clem Cole wrote: >> > Argh -- I standard corrected. We worked hard at the beginning to keep that >> > crap out -- sigh. >> > >> > But at least is does says: POSIX.1-2024 /_does not _//_distinguish between >> > text files and binary files_/ (see the ISO C standard) >> >> It also says "The standard utilities that have such restrictions always >> specify "text files" in their STDIN or INPUT FILES sections," so you can't >> avoid it. >> >> awk is one such utility (sh is not). This is an application requirement, so >> awk is required to add a newline at the end of a file that does not have >> one. >> >> -- >> ``The lyf so short, the craft so long to lerne.'' - Chaucer >> ``Ars longa, vita brevis'' - Hippocrates >> Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ From g.branden.robinson at gmail.com Fri Mar 28 06:02:57 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Thu, 27 Mar 2025 15:02:57 -0500 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: <20250327200257.ftdk2rw5rz2zqtcp@illithid> [CC updated] Hi folks, This thread started on the groff at gnu list, so I'm looping it back in. https://lists.gnu.org/archive/html/groff/2025-02/msg00088.html https://lists.gnu.org/archive/html/groff/2025-03/msg00070.html The formatter, GNU troff: * catenates its input into a single stream, disregarding any file "boundaries" except the first and the last; * does not generally require the input to end with a newline; * throws a warning diagnostic if the input ends with the *roff escape character (and ignores that character); * warns about and discards NUL characters from the input (along with several others that are invalid as input); * always terminates its _output_ files (whether in Kernighan's "trout" format [originally seen in CSTR #97], or "approximate" [`troff -a`] format) with newlines (ASCII LF); * and writes only printable ASCII characters and the space in "trout"/"grout" or approximate outputs, with one esoteric exception.[1] I'd like to know what, specifically, this program should be doing differently. Regards, Branden [1] The `cf` request permits the copying of abritrary bytes from the input file named by it to "grout" output. In the forthcoming _groff_ 1.24.0, the formatter handles this request only in the non-default "unsafe mode" enabled by the `-U` command-line option. Quoting groff's Texinfo manual: -- Request: .trf ["]file -- Request: .cf ["]file Copy FILE as "throughput" to GNU 'troff' output. Each line of FILE is output as if preceded by '\!', but is not interpreted by the formatter. If FILE does not end with a newline, 'trf' appends one. Both requests break the line before reading FILE, unless invoked with the no-break control character. When used in a diversion, these requests embed a node (*note Gtroff Internals::) in it that, when interpolated, causes the contents of FILE to be copied to the output. In AT&T 'troff', 'cf' does so even if a diversion is in use; this behavior is so anomalous that it must be considered a bug. 'cf' copies the contents of FILE completely unprocessed; it is therefore an error to use this request in safer mode, which is the default. Invoke GNU 'troff' or a front end with the '-U' option to enable unsafe mode. 'trf' discards invalid input characters; recall *note Identifiers::. For 'cf', within a diversion, "completely unprocessed" means that each line of a file to be inserted is handled as if it were preceded by '\!\\!'. *Caution:* Use of these requests can put syntactically invalid content into the formatter's output, which 'groff''s output drivers then fail to process. One application is to pass instructions to a postprocessor that interprets FILE's contents and filters it out before sending it to the output device. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From g.branden.robinson at gmail.com Fri Mar 28 06:08:19 2025 From: g.branden.robinson at gmail.com (G. Branden Robinson) Date: Thu, 27 Mar 2025 15:08:19 -0500 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: <20250327200819.vfehsui5lxpcpz67@illithid> Hi Doug, At 2025-03-27T15:50:50-0400, Douglas McIlroy wrote: > It seems my reply to Clem went astray. POSIX Part 2, Shell and > Utilities, is very clear: > > 2..2.2.181 text file: A File that contains characters organized into > one or more lines. > The lines shall not contain NUL characters ... > > 2.2.2.95 line: A sequence of zero or more non- characters > plus a terminating character, > > Oddly--and in my opinion wrongly--the standard excludes empty files. > It would be a shock if an editor or groff refused to process an empty > file and thereby broke Kernighan's law, "Do nothing gracefully". I think I can claim that GNU troff has long upheld that principle. $ printf '' | ~/groff-HEAD/bin/groff | grep . || echo NO OUTPUT NO OUTPUT $ printf '' | ~/groff-1.23.0/bin/groff | grep . || echo NO OUTPUT NO OUTPUT $ printf '' | ~/groff-1.22.4/bin/groff | grep . || echo NO OUTPUT NO OUTPUT $ printf '' | ~/groff-1.22.3/bin/groff | grep . || echo NO OUTPUT NO OUTPUT That carries us back through over 10 years of groff development. I'd bet it's been the case since 1991. Maybe earlier. Regards, Branden -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: not available URL: From tuhs at tuhs.org Fri Mar 28 06:09:25 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Thu, 27 Mar 2025 16:09:25 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> On 3/27/25 3:35 PM, Clem Cole wrote: > Chet  - as I said, we tried so hard to keep that kind of crap out. I believe you! Even the first published version of 1003.2, in 1992, says "The definition of text file has caused controversy." and discusses how the definition seemingly outlaws empty files. > I'll accept that that is what the words say in the >>awk<< specification > document, but as one of the original authors of the first UNIX standard and > the later POSIX standard I can say we tried hard to mak sure we got it > right and follow the idea: A regular file has no structure and never to > allow the standard to impose it. Sure. I think the point is that there are a bunch of utilities, like awk, bc, ed, sort, and so on, whose input is required to be a text file as defined by the standard. troff isn't in the standard, so Branden can do what he likes, and if he wants to accept text files as the standard defines them, then ok. > The issue here is the term POSIX.   Do you mean it to be the kernel (.1) > and if a >>specific<< application with .2  (the C compiler itself, awk, ed) There is no distinction like that any more. POSIX is all of it: shell, utilities, library functions, system calls. The standard guarantees that where an input file is required to be a text file it is explicitly mentioned, and it does that. > I understand having the application do it; I wish it did not. Sorry, jargon. The application is the thing that calls awk; awk is a utility (expressive shrug). So whatever calls awk is responsible for making sure the script and any input files are text files. awk can add a newline to the end of a script file that doesn't have one, so the grammar works right. It's also entitled to do whatever it wants, including throw an error, if the input file isn't a text file. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From mrochkind at gmail.com Fri Mar 28 08:03:12 2025 From: mrochkind at gmail.com (Marc Rochkind) Date: Thu, 27 Mar 2025 16:03:12 -0600 Subject: [TUHS] Marc Rochkind's article on SCCS has been published In-Reply-To: References: Message-ID: Thanks! I've been trying to find out if there's actually a paper copy of Transactions on Software Engineering, but have been unable to. Perhaps they went to all digital (PDF) a while ago? Does anyone know? Here's a direct link to the paper: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10821013 As Nelson mentioned, the TUHS list was very helpful as I was putting the paper together. Marc On Thu, Mar 27, 2025 at 10:09 AM segaloco via TUHS wrote: > On Thursday, March 27th, 2025 at 8:55 AM, Nelson H. F. Beebe < > beebe at math.utah.edu> wrote: > > > The March 2025 issue of an IEEE journal has published Marc Rochkind's > > article on SCCS. TUHS list members discussed a draft version of the > > article last fall. Here is its BibTeX entry: > > > > @String{j-IEEE-TRANS-SOFTW-ENG = "IEEE Transactions on Software > Engineering"} > > > > @Article{Rochkind:2025:RSC, > > author = "Marc J. Rochkind", > > title = "A Retrospective on the {Source Code Control System}", > > journal = j-IEEE-TRANS-SOFTW-ENG, > > volume = "51", > > number = "3", > > pages = "695--699", > > month = mar, > > year = "2025", > > CODEN = "IESEDJ", > > DOI = "https://doi.org/10.1109/TSE.2024.3524947", > > ISSN = "0098-5589 (print), 1939-3520 (electronic)", > > ISSN-L = "0098-5589", > > bibdate = "Tue Mar 25 08:57:56 2025", > > bibsource = " > https://www.math.utah.edu/pub/tex/bib/ieeetranssoftweng2020.bib", > > acknowledgement = ack-nhfb, > > ajournal = "IEEE Trans. Softw. Eng.", > > fjournal = "IEEE Transactions on Software Engineering", > > journal-URL = " > https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=32", > > keywords = "Codes; Control systems; CSSC; Mainframes; Merging; > > Programming; SCCS; Software; software configuration > > management; Software development management; software > > engineering; Software engineering; software > > reliability; Software reliability; software tools; > > Source coding; source control management; version > > control systems", > > } > > > > > ------------------------------------------------------------------------------- > > - Nelson H. F. Beebe Tel: +1 801 581 5254 - > > - University of Utah - > > - Department of Mathematics, 110 LCB Internet e-mail: > beebe at math.utah.edu - > > - 155 S 1400 E RM 233 beebe at acm.org beebe at computer.org - > > - Salt Lake City, UT 84112-0090, USA URL: > https://www.math.utah.edu/~beebe - > > > ------------------------------------------------------------------------------- > > Congrats on publication Marc, and thanks Nelson as always for such > meticulous > bibliography work! > > - Matt G. > -- Subscribe to my Photo-of-the-Week emails at my website mrochkind.com. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robpike at gmail.com Fri Mar 28 10:19:32 2025 From: robpike at gmail.com (Rob Pike) Date: Fri, 28 Mar 2025 11:19:32 +1100 Subject: [TUHS] non-Posix input files In-Reply-To: <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> Message-ID: What is a "character" according to POSIX? I suspect it's just a byte, which is a long-outdated definition, the char keyword notwithstanding. -rob -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Fri Mar 28 10:37:37 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 27 Mar 2025 20:37:37 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> Message-ID: POSIX.1 is clear - a regular file is a sequence of bytes and that is the only requirement. See my previous email where I give the section number which is very crisp. Sent from a handheld expect more typos than usual On Thu, Mar 27, 2025 at 8:19 PM Rob Pike wrote: > What is a "character" according to POSIX? I suspect it's just a byte, > which is a long-outdated definition, the char keyword notwithstanding. > > -rob > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From clemc at ccc.com Fri Mar 28 10:38:50 2025 From: clemc at ccc.com (Clem Cole) Date: Thu, 27 Mar 2025 20:38:50 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> Message-ID: I should have added that .1 does not talk about characters. That was left to ANSI C Sent from a handheld expect more typos than usual On Thu, Mar 27, 2025 at 8:37 PM Clem Cole wrote: > POSIX.1 is clear - a regular file is a sequence of bytes and that is the > only requirement. See my previous email where I give the section number > which is very crisp. > > Sent from a handheld expect more typos than usual > > > On Thu, Mar 27, 2025 at 8:19 PM Rob Pike wrote: > >> What is a "character" according to POSIX? I suspect it's just a byte, >> which is a long-outdated definition, the char keyword notwithstanding. >> >> -rob >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From tuhs at tuhs.org Fri Mar 28 11:02:39 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Thu, 27 Mar 2025 21:02:39 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <6e23237f-4fe8-409a-9583-783b571496d3@case.edu> Message-ID: <2dbaead9-2710-44ab-b307-3170f653be06@case.edu> On 3/27/25 8:19 PM, Rob Pike wrote: > What is a "character" according to POSIX? I suspect it's just a byte, which > is a long-outdated definition, the char keyword notwithstanding. "3.58 Character A sequence of one or more bytes representing a member of a character set. Note: This term corresponds to the ISO C standard term multi-byte character, where a single-byte character is a special case of a multi-byte character. Unlike the usage in the ISO C standard, character here has no necessary relationship with storage space, and byte is used when storage space is discussed. See the definition of the portable character set in 6.1 Portable Character Set for a further explanation of the graphical representations of (abstract) characters, as opposed to character encodings." https://pubs.opengroup.org/onlinepubs/9799919799/basedefs/V1_chap03.html#tag_03_58 -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From arnold at skeeve.com Fri Mar 28 17:22:04 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Fri, 28 Mar 2025 01:22:04 -0600 Subject: [TUHS] non-Posix input files In-Reply-To: <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> Message-ID: <202503280722.52S7M4L6696164@freefriends.org> Chet Ramey via TUHS wrote: > It also says "The standard utilities that have such restrictions always > specify "text files" in their STDIN or INPUT FILES sections," so you can't > avoid it. > > awk is one such utility (sh is not). This is an application requirement, so > awk is required to add a newline at the end of a file that does not have > one. No awk I know of actually does that. Instead, they all treat end of file as terminating the current input record: $ for awk in gawk nawk mawk mksawk "busybox awk" goawk > do echo == $awk > echo -n ab:cd | $awk -v RS=: '{ print ">" $0 "<" }' > done == gawk >ab< >cd< == nawk >ab< >cd< == mawk >ab< >cd< == mksawk >ab< >cd< == busybox awk >ab< >cd< == goawk >ab< >cd< If a newline were being added, the output would have been: >ab< >cd < Nor is this something that I'm about to change, or suggest to any other awk author to change. My two cents, Arnold From tuhs at tuhs.org Fri Mar 28 22:25:00 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Fri, 28 Mar 2025 08:25:00 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: <202503280722.52S7M4L6696164@freefriends.org> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <202503280722.52S7M4L6696164@freefriends.org> Message-ID: <8e480480-bd35-461d-afab-d265775051b8@case.edu> On 3/28/25 3:22 AM, arnold at skeeve.com wrote: > Chet Ramey via TUHS wrote: > >> It also says "The standard utilities that have such restrictions always >> specify "text files" in their STDIN or INPUT FILES sections," so you can't >> avoid it. >> >> awk is one such utility (sh is not). This is an application requirement, so >> awk is required to add a newline at the end of a file that does not have >> one. > > No awk I know of actually does that. Instead, they all treat end of file > as terminating the current input record: Sorry, I should have been more clear. This requirement is for scripts: "The requirement that awk add a trailing to the program argument text is to simplify the grammar, making it match a text file in form." which I imagine all awk implementations do. I think awk does the reasonable thing with an input file that doesn't end in a newline. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From johnl at taugh.com Sat Mar 29 03:15:32 2025 From: johnl at taugh.com (John Levine) Date: 28 Mar 2025 13:15:32 -0400 Subject: [TUHS] Marc Rochkind's article on SCCS has been published In-Reply-To: References: Message-ID: <20250328171532.AEC57C18FC83@ary.qy> It appears that Marc Rochkind said: >-=-=-=-=-=- > >Thanks! > >I've been trying to find out if there's actually a paper copy of >Transactions on Software Engineering, but have been unable to. Perhaps they >went to all digital (PDF) a while ago? Does anyone know? On the IEEE web site the subscription options are electronic only. R's, John From arnold at skeeve.com Sun Mar 30 19:13:03 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Sun, 30 Mar 2025 03:13:03 -0600 Subject: [TUHS] non-Posix input files In-Reply-To: <8e480480-bd35-461d-afab-d265775051b8@case.edu> References: <4d8f49e7-d42e-45e4-9355-d59070985293@case.edu> <8997c7ad-2f46-4d25-bee5-5b35d616a17d@case.edu> <202503280722.52S7M4L6696164@freefriends.org> <8e480480-bd35-461d-afab-d265775051b8@case.edu> Message-ID: <202503300913.52U9D38X906495@freefriends.org> Chet Ramey wrote: > Sorry, I should have been more clear. This requirement is for scripts: > > "The requirement that awk add a trailing to the program argument > text is to simplify the grammar, making it match a text file in form." > > which I imagine all awk implementations do. Ah. Indeed, gawk does this. I don't remember what goes in The One True Awk. Thanks, Arnold From douglas.mcilroy at dartmouth.edu Sun Mar 30 22:48:07 2025 From: douglas.mcilroy at dartmouth.edu (Douglas McIlroy) Date: Sun, 30 Mar 2025 08:48:07 -0400 Subject: [TUHS] non-Posix input files Message-ID: > "The requirement that awk add a trailing to the program argument > text is to simplify the grammar, making it match a text file in form." This should no more be a *requirement* for awk than globbing should have been a requirement for MS-DOS apps. A widespread principle deserves a widespread answer. If it is a requirement on awk, then for interoperability it should be made a requirement on all programs that handle text files, especially editors. The way to do that, of course, would be to redefine text file to allow a non-newline as the last character. Ugh. Not warning perpetuates travesties like "awk END{print NR}' " giving a different answer than "wc -l". I agree that awk does the kind thing by supplying the final newline. But it should recognize that this is non-standard behavior and warn in the interest of discouraging the proliferation of garbage. Postel's so-called "robustness principle" is in play here. "Be conservative in what you send, be liberal in what you accept" would better read, "Send conservatively; receive amply but grudgingly". Doug From arnold at skeeve.com Mon Mar 31 00:00:10 2025 From: arnold at skeeve.com (arnold at skeeve.com) Date: Sun, 30 Mar 2025 08:00:10 -0600 Subject: [TUHS] non-Posix input files In-Reply-To: References: Message-ID: <202503301400.52UE0AoW924488@freefriends.org> Doug, It turns out there wasn't enough context and we both got confused. The "requirement" is that awk '{ print }' ... look to the lexer / grammer as if it had been "{ print }\n". There is no requirement that input "text" files behave as if a final newline is always there. Thanks, Arnold Douglas McIlroy wrote: > > "The requirement that awk add a trailing to the program argument > > text is to simplify the grammar, making it match a text file in form." > > This should no more be a *requirement* for awk than globbing should have > been a requirement for MS-DOS apps. A widespread principle deserves a > widespread answer. If it is a requirement on awk, then for interoperability > it should be made a requirement on all programs that handle text files, > especially editors. > > The way to do that, of course, would be to redefine text file to allow a > non-newline as the last character. Ugh. > > Not warning perpetuates travesties like "awk END{print NR}' " giving a different > answer than "wc -l". > > I agree that awk does the kind thing by supplying the final newline. But > it should recognize that this is non-standard behavior and warn in the > interest of discouraging the proliferation of garbage. > > Postel's so-called "robustness principle" is in play here. "Be conservative > in what you send, be liberal in what you accept" would better read, > "Send conservatively; receive amply but grudgingly". > > Doug From johnl at taugh.com Mon Mar 31 00:23:21 2025 From: johnl at taugh.com (John Levine) Date: 30 Mar 2025 10:23:21 -0400 Subject: [TUHS] the case of the missing new line, Re: non-Posix input files In-Reply-To: References: Message-ID: <20250330142322.58A72C1C7739@ary.qy> It appears that Douglas McIlroy said: >Postel's so-called "robustness principle" is in play here. "Be conservative >in what you send, be liberal in what you accept" would better read, >"Send conservatively; receive amply but grudgingly". I'm reasonably sure he meant it more narrowly than that. "Be liberal" only applied when the spec was ambiguous, as was often the case in the early Internet when new corner cases popped up every day. When the spec clearly said to do something he had no problem rejecting inputs that were wrong. I haven't been writing code to handle byte stream text files as long as you have but there's an unfixable problem since Unix lets you put a stream of bytes not ending with \n into a file and then try to treat it as text. Whatever you do will be wrong for some uses. Perhaps we can segue to the argument about semicolons in Algol and Pascal vs. semicolons in PL/I and C which is more or less the same issue. R's, John PS: If only the Unix file system had records with two-byte length prefixes, or better fixed length 80 byte records, you know, like real computers do. From tuhs at tuhs.org Mon Mar 31 01:35:49 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Sun, 30 Mar 2025 11:35:49 -0400 Subject: [TUHS] the case of the missing new line, Re: non-Posix input files In-Reply-To: <20250330142322.58A72C1C7739@ary.qy> References: <20250330142322.58A72C1C7739@ary.qy> Message-ID: On 3/30/25 10:23 AM, John Levine wrote: > PS: If only the Unix file system had records with two-byte length prefixes, or > better fixed length 80 byte records, you know, like real computers do. Luxury. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From steve at quintile.net Mon Mar 31 02:12:19 2025 From: steve at quintile.net (Steve Simon) Date: Sun, 30 Mar 2025 17:12:19 +0100 Subject: [TUHS] non-Posix input files Message-ID: <66D55858-C664-44DC-85A1-7618FBDB9C41@quintile.net> Re: newlines at the end of files. I hesitate to ask this in such exalted company, but isn’t it a question of whether the newline is (or should be) a line terminator, or a statement separator? -Steve From tuhs at tuhs.org Mon Mar 31 07:41:24 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Sun, 30 Mar 2025 17:41:24 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: <66D55858-C664-44DC-85A1-7618FBDB9C41@quintile.net> References: <66D55858-C664-44DC-85A1-7618FBDB9C41@quintile.net> Message-ID: <70d11a09-f3e7-4f8f-8bbb-96bb8948460e@case.edu> On 3/30/25 12:12 PM, Steve Simon wrote: > > Re: newlines at the end of files. > > I hesitate to ask this in such exalted company, but isn’t it a question of whether the newline is (or should be) a line terminator, or a statement separator? I think that's part of the difference between input files (e.g., to `sort') and script files (the part of the `awk' specification I quoted earlier) and the distinction that POSIX makes between them, where applicable. -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: From tuhs at tuhs.org Mon Mar 31 07:45:27 2025 From: tuhs at tuhs.org (Chet Ramey via TUHS) Date: Sun, 30 Mar 2025 17:45:27 -0400 Subject: [TUHS] non-Posix input files In-Reply-To: <202503301400.52UE0AoW924488@freefriends.org> References: <202503301400.52UE0AoW924488@freefriends.org> Message-ID: <67268936-777c-48ba-86fc-21b942cf4c39@case.edu> On 3/30/25 10:00 AM, arnold at skeeve.com wrote: > Doug, > > It turns out there wasn't enough context and we both got > confused. The "requirement" is that > > awk '{ print }' ... > > look to the lexer / grammer as if it had been "{ print }\n". > > There is no requirement that input "text" files behave as if a > final newline is always there. There's a similar situation with `sh -c', since sh can treat the end of the string as a command terminator. The rationale notes it: "Earlier versions of this standard required that input files to the shell be text files except that line lengths were unlimited. However, that was overly restrictive in relation to the fact that shells can parse a script without a trailing newline[.]" Chet -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet at case.edu http://tiswww.cwru.edu/~chet/ -------------- next part -------------- A non-text attachment was scrubbed... Name: OpenPGP_signature.asc Type: application/pgp-signature Size: 203 bytes Desc: OpenPGP digital signature URL: