Dynamic linking madness: solving a bug in go-nvml

Sat, 15 Feb 2025 00:00:00 +0000

I work on open source observability software, primarily the Google Cloud Ops Agent, OpenTelemetry Collector, and Fluent Bit.
Over the past few years, I have gained an affinity for taking on the types of deep issues that have me journeying as deep into the weeds as I can get. In this post I’m going to go over one of those issues, perhaps partially to self-document everything I learned but also because I think it was an interesting journey worth writing down.

The Issue: go-nvml crashes our OpenTelemetry Collector

One of the features of the Ops Agent is GPU Monitoring; if you install the Ops Agent on a GCE VM with a GPU, you will automatically get metrics for it through the NVIDIA Management Library (NVML), and optionally through DCGM. To achieve this, we built specific instrumentation using the Go bindings for NVML and for DCGM.

We learned when attempting to upgrade our build of the Collector to Go 1.21 that the Collector would crash on startup if a GPU was present on the machine. It produced the kind of panic you wouldn’t usually be used to seeing in a Go program:

SIGSEGV: segmentation violation
PC=0x0 m=0 sigcode=1
signal arrived during cgo execution

Seeing PC=0x0 was very surprising to me. I had no idea how this sort of thing could occur in a Go program, even with CGO. Even more strange was that this crash was only happening on certain systems. How could something like a segfault be system dependent?
I was absolutely hooked. I would not rest until I understood why this could possibly be happening.

You can read the original issue in go-nvml and the issue I opened in golang/go to see the real discussions, or read on for my direct retelling.

Intro to dynamic libraries

This is information that I feel is important to understand the underlying issue. If you are already familiar with how dynamic libraries are loaded, you can skip to How go-nvml works.

Dynamic vs Static Linking

In C and adjacent languages, there are two ways to link a library to your application: static, and dynamic. Static linking is pretty straightforward; the library code is included at compile-time, and when the library is compiled into an object, it is then linked directly into the resulting binary. When the compiled program is run and something from the library is referenced, the implementation is already present within the binary. With dynamic linking, rather than the libraries being built directly into the binary, the libraries are simply referenced by the application to then be loaded at runtime. These will be .so on Linux or .dll on Windows. When the application is run, the operating system receives instructions to look for the libraries on the system, and if they are found they are loaded for the program to use, or if not found the program fails to start.

Static linking sure does sound great, right? There’s not much to think about there, the code is just included in the binary rather than needing to worry about having specific dynamic libraries on the system. Why wouldn’t you always do that? Golang agrees with you; all binaries built with pure Go are completely statically linked. This is actually a selling point of the language, and as an avid user of it I can feel the benefits. It is so nice to build a giant Go program, and just have one nice clean binary at the end with everything the binary needs. As someone working on a tool written in Go, I love that building and distributing it is so dead simple because it’s one statically linked binary. No separate instructions that certain libraries have to be apt installed onto the system, or being forced to distribute a container image for the tool to be usable.

Dynamic linking does have a purpose though, especially when writing lower level applications. One of the most popular ones is C runtime libraries, an implementation of which is available on any Linux distribution, or can be installed on Windows through the Visual C++ Redistributable (something I’m sure many gamers have installed and not really known why). C runtimes can be statically linked in most compilers, however it often doesn’t make much sense to statically link something that is available on most any system the application will run on. One of the biggest reasons is binary sizes. I’ve seen people online be quite confused at the size of a simple Go Hello World program exceeding a megabyte (at least at the time), but the reason for this is that Go does indeed statically link its runtime with the binary which baloons the size of the binary.

Large binaries with lots of static linked libraries has other complications as well, such as the amount of memory the program can take to run. I’d like to write a separate blog post about this at some point, but in short, large statically linked binaries can take more memory to run because loading the binary instructions and data in the first place takes up more space in RAM. The difference with dynamically loading libraries is that the memory the libary takes up in memory can be shared by any other processes using the library. So if we just take dynamically linking libc as an example, there are probably tons of other applications on the system also dynamically loading libc and all sharing that memory in RAM. If all those same binaries had statically linked libc, then they would each have a private copy of libc with all the space in memory that would take up and would be unable to share with any other processes on the system.

Dynamic Loading

The other way to interact with dynamic libraries is by loading them explicitly. With dynamic linking, the required libraries are built into the binary for the system to discover when the program is loaded. However, sometimes the exact library to be used can’t be known at compile time. There may be multiple versions of the library that the program is built to work with, and there needs to be some logic done at runtime to determine exactly which library is loaded. This is common with versioned APIs, where there may be v2 versions of functions present in dynamic libraries (rather than just reimplementing the functions so that backwards compatibility can be maintained, which is really important for dynamic libraries).
So the alternative method is loading the libraries at runtime using dlopen in Linux, or LoadLibrary in Windows. This gives you a handle to the libary loaded into program memory, and to find symbols in it you can look them up in the loaded library using dlsym in Linux or GetProcAddress in Windows.

Exporting Dynamic Symbols (Linux ELF binaries)

We have now exceeded my knowledge of how this might work in Windows, so this section is specific to ELF binaries on Linux.

What typically happens in the linking step is the linker maintains all external references to dynamic symbols in two sections of the binary called the PLT (Procedure Linkage Table) and the GOT (Global Offset Table). The PLT maintains references to all dynamic symbols used, while the GOT maintains the actual address of known dynamic symbols. Upon usage of a dynamic symbol, the compiler references the PLT entry for that symbol. At the linking stage, the linker will add those known symbols to the GOT. At runtime, when a PLT entry is called, it will look for an entry in the GOT and jump to that address, otherwise it willtry to resolve the symbol manually.

Let’s see this in action with a very simple C program:

#include 

int main() {
    printf("hi\n");
    return 0;
}

I’ll compile the binary with gcc and immediately disassemble it:

$ make
gcc -o hello -g -Wall main.c
$ objdump -d hello > hello.s

Let’s navigate the dump to the main subroutine:

0000000000001149 :
    1149:	f3 0f 1e fa          	endbr64
    114d:	55                   	push   %rbp
    114e:	48 89 e5             	mov    %rsp,%rbp
    1151:	48 8d 3d ac 0e 00 00 	lea    0xeac(%rip),%rdi        # 2004 <_IO_stdin_used+0x4>
    1158:	e8 f3 fe ff ff       	call   1050 
    115d:	b8 00 00 00 00       	mov    $0x0,%eax
    1162:	5d                   	pop    %rbp
    1163:	c3                   	ret
    1164:	66 2e 0f 1f 84 00 00 	cs nopw 0x0(%rax,%rax,1)
    116b:	00 00 00 
    116e:	66 90                	xchg   %ax,%ax

What we care about here is instruction 1158, with the call to puts@plt. This is a reference to a symbol puts in the PLT, which is a result of us calling printf from stdio.h in our program.

In the dump we can also analyze the disassembly of the plt:

Disassembly of section .plt:

0000000000001020 <.plt>:
    1020:	ff 35 9a 2f 00 00    	push   0x2f9a(%rip)        # 3fc0 <_GLOBAL_OFFSET_TABLE_+0x8>
    1026:	ff 25 9c 2f 00 00    	jmp    *0x2f9c(%rip)        # 3fc8 <_GLOBAL_OFFSET_TABLE_+0x10>
    102c:	0f 1f 40 00          	nopl   0x0(%rax)
    1030:	f3 0f 1e fa          	endbr64
    1034:	68 00 00 00 00       	push   $0x0
    1039:	e9 e2 ff ff ff       	jmp    1020 <_init+0x20>
    103e:	66 90                	xchg   %ax,%ax

Disassembly of section .plt.got:

0000000000001040 <__cxa_finalize@plt>:
    1040:	f3 0f 1e fa          	endbr64
    1044:	ff 25 ae 2f 00 00    	jmp    *0x2fae(%rip)        # 3ff8 <__cxa_finalize@GLIBC_2.2.5>
    104a:	66 0f 1f 44 00 00    	nopw   0x0(%rax,%rax,1)

Disassembly of section .plt.sec:

0000000000001050 :
    1050:	f3 0f 1e fa          	endbr64
    1054:	ff 25 76 2f 00 00    	jmp    *0x2f76(%rip)        # 3fd0 
    105a:	66 0f 1f 44 00 00    	nopw   0x0(%rax,%rax,1)

We can see that puts@plt ends up doing a jump to address 0x2f76, the location of that symbol from GLIBC_2.2.5.

All of this will be important when we get to the bug itself, so I hope you stayed awake!

How go-nvml works

The Go NVML bindings are an interesting challenge. NVML is a closed source library, and the intended usage is to link to the shared object on the system using a public header. So the way the Go NVML bindings work is as follows:

Provide a copy of the NVML header
Using a 3rd party tool called c-for-go generate a set of Go bindings
Wrap the Go bindings in a light API layer for user friendliness

The function that was segfaulting was actually the first function, nvmlInit. So let’s look at the process of loading this function:

The library libnvidia-ml.so.1 is loaded using dlopen with the flags RTLD_LAZY | RTLD_GLOBAL.
Much of the API is versioned in the library, so each of the versioned APIs are search in the loaded library using dlsym. If the v2 version of a symbol is present, then the bindings are told to use the v2 version of the symbol. In our case, we are using an NVML library that’s new enough to have nvmlInit_v2, so we will end up using that symbol.
Each of these symbols is wrapped with an exported Go function, that loads the library and checks for errors before calling into the generated bindings. So we would call nvml.Init() in our Go code.
This would lead to the generated bindings, which are what actually calls into CGO using import "C" and calls C.nvmlInit_v2().

The Bug

A considerable amount of time has passed since this investigation took place, so I am writing with a ton of hindsight here. This explanation will obscure a ton of straw-grapsing, which you can look through in the Go GitHub issue I opened. For the sake of this post though, I’m going to skip to the part where it all came together and the issue and solution became clear.

Ignoring the deep inner workings of how the NVML Go bindings work, I will focus on the most important core of it. This project generates C bindings based on an input header file. This header file represents the accessible API for libnvidia-ml.so.1, a proprietary binary that is expected to be installed on the user’s machine and loaded at runtime. It is not provided as part of the binding package, and will not be linked as a part of the build. To deal with this, the linker flag --unresolved-symbols=ignore-in-object-files is passed to the linker as part of the bindings. This flag makes it so the symbols from nvml.h, which are not going to be resolved in the build with the shared object missing, will be ignored by the linker and not considered an error.

Our initial knowledge was that the bug occurred under the following circumstances:

Using Go 1.21
Building on Ubuntu Jammy or newer, but not on earlier distros like Debian 10 Buster

While at this point in the investigation a lot of these concepts were somewhat new to me, I did have a feeling that given the issue was with a dynamic library loaded through CGO, the issue probably had something to do with linking, and I suspected the version of ld on the system was the culprit, and that something in the CGO layer of Go had changed in conflict with a new version of ld. It took me a non-trivial amount of time to realize why, but this ended up mostly correct.

Standalone Repro

In order to a) determine whether this was go-nvml specific or something inherent to Go, and b) to not require me to have NVIDIA libraries installed while developing, I created a standalone reproduction. This confirmed that setting up a small CGO program under the same circumstances (providing a header but no object and passing --unresolved-symbols=ignore-in-object-files to ld) panicked in the exact same way. We can work with this from here on out.

Comparing Go 1.20 to 1.21

Using the reproduction, I will build 2 binaries, one with Go 1.20 and one with Go 1.21.

The repro program includes a header that defines a function get42 and makes a call to it. This symbol should be unresolved in the build, and should show up as such in our binary. If we use nm on the Go 1.20 binary, we can find our get42 existing as expected as an unresolved symbol:

$ nm cgo_dl_repro_go120 | grep get42
0000000000483760 T _cgo_49665a31f432_Cfunc_get42
                 U get42
0000000000483580 t main._Cfunc_get42.abi0
000000000051b1c8 d main._cgo_49665a31f432_Cfunc_get42

However, checking out the Go 1.21 binary shows an important difference, which is that this symbol is missing!

nm cgo_dl_repro_go121 | grep get42
000000000047ce70 T _cgo_49665a31f432_Cfunc_get42
000000000047cca0 t main._Cfunc_get42.abi0
000000000051b1a8 d main._cgo_49665a31f432_Cfunc_get42

The only get42 symbols are the CGO calls we make in the Go code and the symbol from the C code that CGO generates.

I did not fully grasp what I was looking at when I found this, but this turned out to be the important difference. The get42 unresolved symbol being missing actually meant that the get42 symbol did not have an entry in the PLT. This results in Go generating assembly for this program that looks like this (disassembled by go tool objdump):

TEXT _cgo_49665a31f432_Cfunc_get42(SB) 
  :0			0x47ce70		4154			PUSHQ R12			
  :0			0x47ce72		55			PUSHQ BP			
  :0			0x47ce73		53			PUSHQ BX			
  :0			0x47ce74		4889fb			MOVQ DI, BX			
  :0			0x47ce77		e88416feff		CALL _cgo_topofstack(SB)	
  :0			0x47ce7c		4989c4			MOVQ AX, R12			
  :0			0x47ce7f		31c0			XORL AX, AX			
  :0			0x47ce81		e87a31b8ff		CALL 0x0 <-- EVIL!!!!	
  :0			0x47ce86		89c5			MOVL AX, BP			
  :0			0x47ce88		e87316feff		CALL _cgo_topofstack(SB)	
  :0			0x47ce8d		4c29e0			SUBQ R12, AX			
  :0			0x47ce90		892c03			MOVL BP, 0(BX)(AX*1)		
  :0			0x47ce93		5b			POPQ BX				
  :0			0x47ce94		5d			POPQ BP				
  :0			0x47ce95		415c			POPQ R12			
  :0			0x47ce97		c3			RET

And a reminder of what that panic looks like:

SIGSEGV: segmentation violation
PC=0x0 m=0 sigcode=1
signal arrived during cgo execution

That explains how we’re getting program counter 0x0!

The Solution

While I spent a considerable amount of time experimenting and looking through go tool linker and cgo source code to try and understand what was going on, and I did learn a lot, I ended up finding the problem with a good old fashioned git bisect. I ended up at commit 1f29f39.
The message of that commit: cmd/link: don't export all symbols for ELF external linking
The problematic code change was from this:

// Force global symbols to be exported for dlopen, etc.
if ctxt.IsELF {
	argv = append(argv, "-rdynamic")
}

To this:

// Force global symbols to be exported for dlopen, etc.
if ctxt.IsELF {
	if ctxt.DynlinkingGo() || ctxt.BuildMode == BuildModeCShared || !linkerFlagSupported(ctxt.Arch, argv[0], altLinker, "-Wl,--export-dynamic-symbol=main") {
		argv = append(argv, "-rdynamic")
	} else {
		ctxt.loader.ForAllCgoExportDynamic(func(s loader.Sym) {
			argv = append(argv, "-Wl,--export-dynamic-symbol="+ctxt.loader.SymExtname(s))
		})
	}
}

What does this mean? The code used to always pass the -rdynamic flag to gcc, which passes --export-dynamic to ld under the hood. The change for the code changed to only pass -rdynamic to gcc if the particular linker flag is not supported. The justification for this is in this issue (TL;DR it’s because this is unnecessary in most cases and thus wastes space on a majority of binaries). While it’s hard to know exactly when the --export-dynamic-symbol flag was added to ld, it seems like the only plausible reason that this issue only occurs on an ld version that is high enough.

Since -rdynamic is now not always being passed in the CGO build process, the change I ended up on was to modify the binding generation in go-nvml to always pass the --export-dynamic linker flag. This doesn’t break if the -rdynamic flag is passed, but ensures that we still have the required ld flag being passed in newer versions of Go and ld.

Conclusion

This was a very hard issue to figure out, and was around a week’s worth of effort. The solution was 16 characters. This is why it’s hard to measure coding productivity by raw output! :)

I’m still glad I went through all of it, and glad I went through the process of re-documenting it by writing up this post. Hopefully you got some enjoyment out of my adventure!

Software Industry vs Software Education

Fri, 08 Apr 2022 00:00:00 +0000

I’ve decided to put pen-to-paper (keyboard-to-markdown?) on a rant I’ve given to friends and colleagues numerous times since my University career ended. I want to talk about what I like to jokingly refer to as “the ticket to the industry”: the Bachelor’s Degree.
If you pull up a software dev job posting and check the requirements, there is a ~99.999% chance that one of those requirements is a “Bachelor’s Degree in Computer Science or a related field”. If you’re lucky, it will add “or equivalent experience”.

My Bachelor’s Degree

During my undergrad, I hated pretty much everything about school. I knew I loved Computer Science, and I was utterly committed to completing my degree, but I barely made. The system really felt like it was a carefully designed torture chamber made just for me to pay thousands of dollars to suffer in.
I loved so many of the concepts and subjects I was learning in Computer Science and Math. However, particularly for Math, the shift in priorities coming to University were a shell shock. The goal of these classes didn’t feel like learning anymore; they felt like a game to achieve the best mark. It was a game I sucked at. I don’t think there’s any words to effectively describe how bad I was at exams. I never properly learned how to cope with my intense distractibility and struggle to focus throughout school, and my memory for concepts I didn’t deeply understand was incredibly fragile. My success in any courses, even Computer Science ones, hinged almost entirely on what percentage of the mark was derived from exams. Even worse were the courses that required you to pass the final exam to pass the course (this is the worst of the “torture chamber designed for me”). I failed 2 classes, both through final exams alone: Object-Oriented Programming (which I had done extensively even then, but choked writing Java on paper), and Probability (which terminated my then-burgeoning interest in Data Science).

I wouldn’t think so much about the good old torture chamber now, considering I’m years removed from receiving my degree, but I am constantly upset on reminder of what university could have been for me. My passion has shifted from just Software Development to deep Computer Science. My wife loves to poke fun at me for reading basically only CS textbooks, and my spare time is often spent learning about increasingly deep computer science concepts. While I was in school, all I could feel was the stress-rage-hybrid of looming exams, and the intense desire to be free and get what I really wanted; a job as a software developer. I’m so grateful to have achieved that goal, but I still can’t help but think about how different my life would have been if I hadn’t had to go through something I was so bad at to get there.

Believe it or not, this article is not just for me to complain how much I hated university and exams (although it was cathartic to write, and I’m leaving it in). Despite how much I hated it, I really can’t blame University for being… University. The validity of the post-secondary system isn’t really the dialogue I’m going for (at least today). The real point here is that University was simply not for me.

University wasn’t for me, but what choice did I have?

I was lead to believe that University was the only way to get into the software industry. When it came time to decide my future, there didn’t seem to be an alternative. Even going for college seemed like a death knell for your chances to break into the industry, and I couldn’t even fathom trying to get in as a self taught developer. These were obviously not true then, and are even less true now as the narrative around alternate paths to the industry has improved significantly. At the time however, I hadn’t connected to any tech communities even online, and was far too sheepish to reach out for mentorship. My pipeline to the industry was driven largely by how my high school directed me and my vapid attempts to make video games on my own time. I could only consume the information and assumptions that were easiest available to me. It sure didn’t seem like bad information at the time either; every job posting I checked required a Bachelor’s Degree. EVERY one. It sure seemed like my only course of action.

Disdain for Bachelor’s Degrees

I have been waxing lyrical about my own woe-is-me relationship with University, but I’m not alone. The glorified engagement farm widely known as tech twitter has been firing away the catchy tweets about how they got into the industry without a degree, or vaguely asking whether the twitter-verse thinks a degree is required to get a job as a developer. (I guess I shouldn’t be so cynical about it, they are generating infinitely more clicks than this blog with no SEO will).

I’ve spoken to many of my peers and colleagues in the community, and an (anecdotally) common sentiment is that they feel university did not prepare them adequately for the “real world” of software development. Common misgivings were the outdated technology used in courses, the heavy requirements for seemingly unrelated maths, and lacking guidance on realistic software industry skills (source control, software architecture, web development tooling).

It seems like so many people are on the same page about this in their own way: what is taught for a Bachelor’s Degree seems to be heavily at odds with the standard industry requirement for it.

Do universities need to “get with the times”?

You could get upset at post-secondary programs in general. Perhaps these programs need to teach more applicable, employable skills. Maybe they should be directing student learning toward more practical topics to increase their confidence to enter the industry. If these classes aren’t teaching students what they feel will be useful for their jobs, then what’s the point?

The counter to this is often to espouse the value of foundational knowledge. The things you learn in University may not be things most folks will do day to day in their careers, but these fundamental concepts are an effective way to become a well-rounded developer.
So which of my strawmen is right?
SIKE, they both are. Sort of.

The modern software industry

Software is a pretty young industry overall, however the prevailing goal of most software jobs has remained relatively constant: to create products that people use. While that goal hasn’t changed much, the tools available to accomplish that goal have changed drastically. The advancement of developer-focused tools and frameworks has lead to a major shift in the kind of skills necessary to get started developing software. The tools at a developer’s disposal have become so sophisticated that they abstract numerous fundamental building blocks that previously required deep knowledge to use. Web application frameworks blur the line between servers and clients; Kubernetes has made distributed systems a game of learning the available tools; UI design suites have broken the barrier between vision and implementation. (Disclaimer: I know all of these are severe oversimplifications). The general theme of modern tools is to abstract difficult foundational concepts to flatten the barrier to entry; your success in these tools would no longer hang on how well you understand the complex technical concepts it abstracts, and instead on how well you can learn the tool (which is usually a far faster process). I’d argue there’s very few tools that have fully achieved that goal, but I can feel the paradigm shift. Overtime, a new vacuum of the industry has formed entirely for talent with existing expertise in these specific modern tools.

Herein lies the crux of the problem; CS programs at post-secondary institutions produce well-rounded software developers who can leverage their foundational knowledge to numerous paths in software development, but their education may not have prepared them for the overwhelming number of jobs that require specific skills in industry tools. So many students simply wanted to start working in the industry, but the industry pushed them toward a seemingly false start.

Software Development as a Trade

I think there’s still a place for University. For these modern tools to exist as monolith abstractions of complicated foundations, there needs to be niche experts to build them. There are still a number of software jobs that would benefit greatly from folks with deeper academic knowledge. Still, a large number of jobs don’t seem to be after academics, rather after software developers as practitioners of a trade. I think framing software development as a trade vs. an academic pursuit serves the needs of the modern industry pretty well. Software development as a trade is more like using the development of software as a means to an end to accomplish business goals. Practitioners of software development as a trade would be trained specifically in the relevant tooling, and their expertise would be catered to the needs of the industry. Software developers as academics, computer scientists if you will, would be the folks doing intense research and studying. They would be the experts building the bedrock of computing, and the tradespeople would be the experts bringing it to wider society.

What would this sort of separation gain us? For starters, there are more paths for future developers to enter the industry. Rather than University seeming like the only path forward, perhaps there could be a shorter trade program, or something like a bootcamp that trains people explicitly to become practitioners; they would start with the basics as any software education should, but provide a more direct path to preparing specifically for jobs in the industry. Such a large number of developers endeavour only to build great things and use software development as their tool to do that; more focused trades programs would get them to that goal faster. It would also increase the rate of new talent joining the industry, and that new talent would arguably be more primed to onboard to the average company building products with popular tools. This would still leave a place for universities not only to continue teaching the important required topics of a Computer Science degree, but even relieves pressure on them to conform to the needs of the industry. Developers with aspirations to learn specifically Computer Science can go into post-secondary, and those interested mainly in Software Development can pursue it as a trade.

I think we may sort of be heading in this direction already; I have never been to one, but bootcamps do seem to be similar in spirit to what I’m trying to describe. I think one of the limiting factors for alternate paths to the industry is the persistent dogma around Bachelor’s Degrees, and the usage of the degree as an arbitrary barrier for new folks to enter the industry. I think the biggest realistic step forward for the industry would be to not only acknowledge the validity of alternate paths, but also understand where they may be advantageous instead of simply settling for them.

If I’m being honest with myself, much of this is pie-in-the-sky optimism; any immediate shifts like this would require disjointed demographics and organizations with different values to somehow shift their priorities in sync. We do appear to be taking baby steps though; there is a rise in vocal self-taught programmer pride, and an increasing number of developers are finding their way into the field through bootcamps and online courses.

To be Fair and Balanced though, this idea would be unlikely to become a utopia. It would help a lot more people enter our industry in ways that suit their goals and learning style, and would allow companies to hire in a way that more directly suits their requirements. However, in our present environment of late-stage capitalism -

I SNUCK AN ANTI-CAPITALIST PREMISE INTO MY BLOG POST

GOTCHA! YOU SHOULD SEE YOUR FACE RIGHT NOW!

In our present environment of late-stage capitalism, we apparently cannot get enough of social and class hierarchies. A field like software development attracts a lot of pearl clutching and vapid gatekeeping. A very loud minority of people are desperate to fight over the definition of a “real developer”, feeling personally offended and protective of the title because some developers never needed to untangle hundred line C++ template error messages. This sort of desperation to attain and maintain pseudo-intellectual superiority over each other would absolutely be exacerbated by a publicly accepted difference between “software tradespeople” and “computer scientists”. In the worst case (and probably most likely) scenario, companies will absolutely eat that up. Whatever their public messaging might be, they would likely use this difference to create new pay hierarchies, and find a metric-assload of creative ways to keep tradespeople underlevelled and underpaid. I think there’s a very real chance it could devolve into a sort of class system among software developers, where university degrees arbitrarily earn smarmy confidence, and higher wages for similar work; arguably, this is even today’s status quo because some people suck.

Conclusion

I don’t know that there is a perfect solution to the problematic relationship between our industry and Bachelor’s Degrees. I’m certainly not one to suggest sticking to something just because it’s the way we’ve always done it, so I can’t help but ideate some perfect balance we may never truly achieve.
If you’ve clicked on this post there is a high chance we know each other and you’ve already heard me say all this, but if not: first of all, hi! Thanks for reading! If you are a new software developer, I hope you don’t feel as trapped as I did when I started, and that you are aware of the paths open to you. If you’re already in University, I hope this doesn’t somehow sour your perspective; I may have hated school, but I still consider it an incredibly valuable part of my life, and I hope it is the same for you. If you’re someone who does hiring in any capacity, I hope this post inspires you, in however minor a way, to critically consider what you look for and how you can adjust to keep yourself open to the talent that’s waiting to find you.
Whoever you are, I hope you took something away whether you agree or disagree. As always, I’m happy to discuss either way, because I don’t claim to be in any way an expert and I would love to hear your thoughts.

Self hosting with Caddy, gitea, hugo, bitwarden, and more!

Sat, 15 Jan 2022 00:00:00 +0000

I have always wanted to try self-hosting things that are clearly better done by a SaaS provider. That’s why I took a few hours, a big ol’ Ubuntu VPS, and a domain name to try and self-host a bunch of things I use every day! I might hate myself later, but I’m having fun for now. I decided to write a little bit about what I did to make everything work.

UFW (Uncomplicated Firewall)

If I had gone with a more fully-featured cloud hosting provider, such as DigitalOcean or Linode (not affiliated with either), I would have been able to configure my VPS’s firewall through a UI console. However, I had already purchased a really large server for cheaper with another provider. This meant I needed to set up my firewall right on the server myself. As a software developer, I am horrible at SysAdmin by nature; the idea of setting up critical iptables shook me to my very core. This was why I chose to configure my firewall with the easier to use ufw.

The setup I needed was as follows: deny all incoming traffic by default, allow all outgoing by default, then allow traffic on the ports I needed (namely SSH, HTTP, and HTTPS).
What scared me the most was potentially locking myself out of my server. Working directly with iptables put me at risk of this, as iptable rules operate at the kernel level. If I changed one wrong thing, I could completely lock myself out of my server. ufw didn’t have this problem, because it runs as a service; I could configure all of my ufw and start the service when I felt everything was ready. I did a dry-run on a temporary tiny VPS to make sure I wouldn’t lock myself out (sudo removed for brevity):

ufw default deny incoming
ufw default allow outgoing
ufw allow ssh
ufw allow http
ufw allow https
ufw allow 25565 # I have a Minecraft server running on this machine already!
ufw enable

After doing this, I disconnected and tried to ssh into the server again. It worked as expected, and now that I’d verified it worked on the test VPS, I ran them on my main VPS with similar success.

Time to come clean though; this wasn’t the first thing I did. This was actually one of the last things I did (hence why I was extra scared of locking myself out). The main reason I did this was to make sure my server wouldn’t accept connections to http://:. This worked for some of my services, but not for one of them. The reason it didn’t work is because ufw by default cannot stop Docker from accepting connections directly to published ports. On install, Docker makes entries in the iptable rules that are evaluated before ufw’s. I tried a veritable cornucopia of bad solutions before finding this repo that provided the perfect solution for me.

With my server locked down (let’s pretend that’s the first thing I did, like it should have been) it was time to move on to the server I decided to use for reverse proxying.

Caddy

In my previous attempts at hosting things myself, I had fumbled through nginx reverse-proxy tutorials. While nginx is a great skill to learn, and an incredibly mature tool, I decided to take a different route this time and use Caddy. I acknowledge that nginx is great technology, but after using it on this server I am officially sold on Caddy.

The two reasons I love Caddy are its super easy configuration and its automatic https. The biggest challenges I had with hosting things myself in the past is partially my poor nginx configuration abilities, but largely that messing with certbot (an admittedly great and easy to use project) was a lot more work than I wanted to constantly manage for every single project that I wanted to host. HTTPS is a process that can be automated, and Caddy proves that. Now, I simply add a new site configuration block to my Caddyfile and I already automatically have HTTPS for that site (provided the domain name I specified has DNS configured correctly, more on that later).

I installed Caddy on my system through the stable apt repo. I used it as a systemd service, and added configuration to the /etc/caddy/Caddyfile. When I mention “adding something to the Caddyfile” further down this article, I am referring to editing this file and running sudo systemctl restart caddy to reload the configuration.

Gitea

The first thing I wanted to set up was my own git server! I used a fantastic open-source project called Gitea that replicates a lot of GitHub’s features. It’s missing some of the more advanced GitHub features, but as a place to toss my personal project code it seemed perfect.

I installed Gitea through a user-maintained deb package. Honestly, if I were starting from the top, I probably would just install it through Docker, but this is working fine for now anyway. Installing this package created a gitea user for me, so I created the necessary directories from this Gitea tutorial and gave the gitea user access instead of the git user that these docs suggest manually creating. The rest of the steps from then on in the docs ended up working for me. So now I was able to run the gitea systemd service on port 3000. The next step was to set up the reverse proxy so I could get into my Gitea instance.

I have a domain (ragecage64.com) with Google Domains, however I don’t use anything specific to that system. All I had to do was a DNS Address record for the git subdomain I wanted. It looked something like this:

Once this was set up, I added the following block to my Caddyfile:

git.ragecage64.com {
    reverse_proxy localhost:
}

After restarting caddy, I had https://git.ragecage64.com ready to go! There was a first-time set up screen that I forgot to take a screenshot of, but is pretty self-explanatory. I spent a little bit of time selectively migrating the repos I wanted to keep to my new Gitea instance, and setting up my SSH key and new username. Really loving Gitea so far!

Get files from Gitea repos within the server

This was an important step to the next couple things I’m going to talk about. Once things are pushed to my Gitea instance, I’m able to access those files on the server to perform any kinds of builds I may need to run them.
To do this, figure out where you Gitea is storing its repos (for me it was in the default directory /var/lib/gitea/data/gitea-repositories/). In this folder you can find the bare repositories. To get the data from these bare repositories from any where on your server, you can clone the bare repository, i.e. git clone $GITEA_REPO_DIR/.git. Now you have a copy of the code on your server to do with whatever you please.

Bub the Discord Bot

This was probably the easiest thing to set up. My bot is written in Go, meaning all I need is a Discord Bot Token in my local env and to run the compiled program. First, I pushed the bot code to my Git. Then I pulled the bare repo on my server, ran the command in the Makefile, and ran the compiled binary. Pretty simple setup!

The challenge came when I needed to run multiple apps at once on my server. The more correct thing would probably be to create systemd services out of everything, but that’s hard. :)
The two things I needed to run and quickly get at logs for are my Minecraft server and Bub. I used multiple GNU screen sessions to accomplish this. I started named screen sessions like so:

screen -S minecraft

Once I created the screen session, I ran the server and detached from the session with Ctrl+A, D.
Then when I needed to reattach to the screen, I could use the command:

screen -xS minecraft

I did the same thing with my bot. Pretty good setup overall!

This Blog

I am now also hosting this blog on my server! This blog is a static site made with Hugo, which I highly recommend if you’re looking to make a blog. The great thing about this was that a static site is similarly easy to host through Caddy!
I started by installing hugo, cloning the bare repo (I had to --recurse-submodules because I installed a theme, important step), and running hugo. This built my site to the public folder (optionally, could output this public folder to a smarter place in the server). Next, I added the following block to my Caddyfile:

blog.ragecage64.com {
	root * /public
	file_server
}

And added the similar DNS record as above.
Now I have the site you are currently on! To update my blog now I push to my repo, pull the server copy of the repo, and run hugo. A bit more work than GitHub Pages where I previously hosted this site, but every part of this is more work than it used to be and I’m still having fun!

BitWarden

The last thing I got working was my own BitWarden instance to share with my partner and family. To do this, I decided to run a docker container of the Rust implementation of Bitwarden. I created a docker-compose file for the container (which maybe wasn’t necessary because I’m just using SQLite anyway, but that makes it easier to add a real DB later) and ran it in the background with docker-compose up -d. I then created a DNS record and Caddy reverse proxy similar to Gitea above, and followed the instructions to connect to BitWarden clients to my instance. When I first started the instance, I used the container environment variable SIGNUPS_ALLOWED=true. This allowed me and my partner to quickly sign up, before I restarted the container with this environment variable set to false. This means only the people I want to sign up for my instance can; it’s only on a SQLite database, it’s not exactly web scale!

Who knows what else!

Now I have an easy to way to host any future projects on one server! It’s pretty exciting, and I don’t know what’s going up next, but next time I think of something exciting it’s fun to know I always have somewhere to put it!

colors and faker: a case study on the npm ecosystem

Mon, 10 Jan 2022 00:00:00 +0000

Foreword

For years I’ve listened to software engineers more experienced than myself poke fun at the left-pad incident. Usually used as a joking throwaway comment about keeping package-lock files in sync, or in accordance with the related xkcd comic (which seems to get more relevant the older it gets). It was technically just before my time as a professional developer (my less-than-stellar jQuery experimentation was safe from this at the time), so I would take it as a cautionary tale that taught us an important lesson about the software supply chain.
It also informed a lot of the learning I have done over the years about what it means for software to be open source, the nuances of open source software licensing, and the difference between freedom and beer. I’ve always been passionate about software that is at the very least source-available; the collaboration between so many talented and passionate people has always felt like something of a panacea to me (depending how rosy my glasses are that day).

This is all to say that reading about what happened with the npm packages colors and faker left me with a lot to say. I would have gone the lazy route and tweeted my thoughts to the void as usual, however I haven’t posted to this blog in checks notes 10 months! Some content creator I am. The shareholders will have my head!

So without further ado, I’d like to take as nuanced a look as I can at all the moving pieces of this fascinating case study.

What happened with `colors` and `faker`?

The headline is not clickbait enough to attract anyone who does not already know about this situation (other than my proofreading partner, hi dear!). However, for the purposes of this post, I’m going to pretend you have no idea what’s going on and summarize quickly so we can build some context.

colors is an npm package that enables the user to colour their console text in their command line applications. Command line applications may not be the first thing that come to mind when you think of Node.js, but a vast majority of JavaScript dev tools have a command line interface and leverage this package to improve the appearance of their output.

faker (no link for this one; will explain shortly) is an npm package that will randomly generate data, however this data is believable; it falls into common data patterns like names, street addresses, movie quotes, etc. I am not exactly sure which was first, but this library was heavily inspired by counterparts in other languages such as Perl, Ruby, PHP, and Python.

These packages are authored and maintained by the same developer: Marak Squires (see his GitHub). These packages were used by thousands of Node.js applications, all published to and subsequently downloaded from the node package manager’s central repository. This large repository of packages is owned by npm, Inc. and GitHub, and is the source from which virtually every node application pulls at least some open source dependencies. colors and faker were both open source and published with the MIT License.

Last year, the author of these two packages decided that he was no longer interested in developing and maintaining the packages. They opened this issue, declaring that they would no longer be working on the package. Last week, they took this a step further: they intentionally introduced an infinite loop with spooky text in colors and, as we zoomers might say, yeeted faker from existence (not really, but I will expand on that later on). This affected thousands of Node.js applications, which means it affected a ton of developers and companies of all sizes. And I mean “all sizes”; one of the affected packages I am personally familiar with is Amazon’s aws-cdk, and this is just one of many widely used packages that were essentially bricked until the issue was resolved.

Now that we have a general idea of what happened, I’d like to add my interpretation of what it means to work with npm.

What does it mean to download an npm package?

One of the earliest lessons I learned when I first started using Linux is to not download and execute random scripts without reading them first and understanding the risk. They require that you make a conscious decision to trust the source of the script. This makes sense when you think about it; a bash script (especially with sudo permissions) has the power to do an incredible amount of damage things to your system (or maybe just fork bomb you as an epic prank, anything goes). Usually, where possible, you were encouraged to install your software through your distribution’s central package repository. This large repository of packages, all built specifically for the distribution, is owned and maintained by a dedicated group of volunteers or employees who vet and approve each one. There are ways for independent users or organizations to host their own repositories of packages, and integrate with the distribution’s respective package managers. These require that you trust their source, similarly to downloading and executing bash scripts.

The reason for this tangent is to relate it back to npm installing a package. Installing a package through npm is a combination of these two flavours of installing software; it is a package manager similar to the ones commonly included in Linux distributions, however each package in its central repository is not vetted and managed by a group of volunteers or employees. When you download and execute a package from npm’s central repository, you are trusting the author of that package.

Now it’s obviously a bit extreme to directly equate installing an npm package to sudo executing a bash script. It’s a lot more nuanced than this since the most popular packages in the repository also have the most security experts’ eyes on them at all times. They may not be constantly approved by a central group of people, but in a perfect world issues are swiftly reported and dealt with by package maintainers and consumers of the package. npm also has a number of mechanisms to keep dependencies at a certain version until you trust that an upgrade is up to your standards.

What does it mean to publish an npm package?

Every JavaScript package on npm is open source by nature. JavaScript is an interpreted language, and no amount of obfuscation will truly hide the JavaScript being shipping when a package is published to the central repository. This code can be licensed under any open source license that suits the project’s needs. This license legally defines the way that the copyright holder approves the code to be used. Once the license (or lack of one) is defined, and the minimum setup requirements are present, you are free to publish whatever you would like. You could publish the next big JavaScript framework, a useful new CLI tool, nothing, whatever you’d like provided it is legal.

How we’ve forgotten this

Node.js and npm are tools that have come about as close to ubiquity as a very short list of technologies ever have. The number of new developers who’s first step of their journey was/will be to run npm install is staggering. The largest companies in the world continue to rely on npm in varying capacities, and have contributed a large number of popular packages to its ecosystem. When such an apparent consensus of people are doing something, it’s easy to interpret some guarantee of safety. You wouldn’t jump off a bridge just because someone else did, but if 10 000 people jump off a particular bridge every day it’s gotta be safe, right?

One-way Trust

Let’s have a quick look at that MIT license again. The first line of this license states: “Permission is hereby granted, free of charge, to any person obtaining a copy of this software[…] to deal in the Software without restriction[.]”. Further down, the license states: “[sic, all caps] THE LICENSE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED[.]”.
I am not a lawyer, but my interpretation of this in plainer terms is that anyone is allowed to use this software as they see fit, but that the copyright holder provides no guarantee of anything that may come with that freedom. If it doesn’t work for some reason, doesn’t do what you want, or has some kind of major flaw, the copyright holder does not bear the responsibility. When some of the biggest packages on npm are maintained by such large, efficient, and dedicated teams, it is easy to forget that warranty of any kind is almost never included by default; that’s the one of the exchanges that is made when the software does not cost anything.

How this relates to `colors` and `faker`

All of this is to contextualize my general takeaway from this situation; the author of colors and faker were within their right to self-sabotage their packages. A quick disclaimer that I personally dislike what they decided to do, and I’ll shortly expand on that, but I want to explain why I think what they did is within their right.
I think this goes right to the root of what open source is, distilled to its core: some person writes code, they put it somewhere for the public to see, the public can do with it whatever the license permits. To be reductionist for the sake of brevity (let’s laugh and pretend I have that), I think every single piece of open source code reduces down to this. Even if code is published by a Fortune 500 company who have great motivation to maintain that software until the end of time, even if software is so popular that volunteers will diligently shepherd it along the moral high ground to the heat death of the universe, open source software is still this at its core. When you use this published code, you have made the implicit decision to trust the person who published it. The only binding promise that the copyright holder have made in return is that the code exists and you can use it.

Marak wrote these packages and published them under the MIT license. They can update this code however they want. If they want to intentionally corrupt the package making it print LIBERTY a bunch of times to scare the bejeezus out of me, it appears to me that they are within their right to do so. I saw a number of people on Twitter decrying a “breach of trust”, and how it ruins the image of the npm ecosystem for a package author to do this. In my opinion, something is only a “breach” of trust when that trust is established two way and both parties are responsible for it. Package authors are not responsible for the one-way trust you placed in them, nor are they responsible for the effect their actions have on the reputation of the npm ecosystem.

The way Marak went about this was pretty “chaotic evil” for my tastes, and I don’t personally appreciate that they broke the trust so many people placed in them. However, I think a large number of us have forgotten that the people we download all these gigantic dependencies from are not in any way obligated to maintain a relationship of trust, and on paper could go nuclear at any time. The modern software supply chain has led to us unknowingly permitting this at an unprecedented scale for decades.

You are being ridiculous

Yeah, sort of. I am being pretty “doom and gloom” on purpose to frame this crucial portion of the software supply chain in a particular way. Let’s come back down to reality for a bit to talk about what all of this means for well meaning software developers just trying to get their job done.

How to improve npm safety (while still using npm)

I should preface this section by clarifying that I’m relatively new to this problem at scale, and there are experts far smarter than myself working to solve and educate on these problems. I’d still like to close this article off with tips that I have for developers who are worried about how to protect themselves further in the future.

I imagine there are a number of you reading this who are upset that I am appearing to suggest every line of open source code you pull down be audited. We all know that’s really not feasible at any scale larger than “demo”. This is why there are so many tools, such as sonarqube, snyk, and every project’s most diligent contributor dependabot (I am not affiliated with any, just a few I’m familiar with) built with features to track and audit dependencies you’ve brought in that may contain vulnerabilities. However, these tools don’t necessarily help if you’ve accidentally pulled in a bad dependency during development.

When a package is published on npm, save for select circumstances, the version published is there in perpetuity unless npm decides to take it down. Even though faker@6.6.6 which essentially deletes all of its code is published on npm, it does not remove the history of faker releases. Code can only be unpublished from the registry if the package has no dependents, which faker had a number of. In this case, and the case of colors@1.4.44-liberty-2, npm provides the tools to protect against these releases if you are a direct dependent.
If you are a newer developer, I recommend understanding semantic versioning as fully as you can; it is one of the greatest defenses to much of what I’ve mentioned in this article. The most common practice when using semantic versioning is to use the ^ caret prefix on most of your dependencies because this is what npm does by default when installing a new dependency. It means that any updates to the major version will not be installed, but the latest release of that major version will be used. Similarly, there is the ~ tilde prefix is similar, which will not allow any updates to the minor version. Providing no prefix will pin a dependency at a particular version. If you aren’t already, it is highly recommended to use more discretion when deciding which prefix to use on new and existing dependencies you choose to bring in.
An important caveat here is that even people who were more reserved by only allowing patch releases of colors, which should suggest only bringing in bug/vulnerability fixes, still got screwed here by unexpectedly allowing a very breaking change. However, this defense is still good against typical benign cases.

The issue with increased discretion is it usually means you have to do more manual work when it’s time to update. Working in the Node.js ecosystem is implicitly accepting that everything moves incredibly fast, and it’s a danger to your application’s continued health to let things fall too far out of date. While far from a perfect solution, one of my favourite ways to combat this is npm-check-updates. It provides an optional interactive environment to select updates to packages that you feel confident are safe. It is a nice convenience in a process that haunts Node.js developers everywhere.

The sad truth is that there probably is no true way to stop this from affecting you. Semantic versioning is the biggest help when you are a direct dependent of the code you are trying to control. Unfortunately, npm package dependency graphs can go many layers deeper than we bargain for. If you pulled in even one odd dependency that doesn’t pin some sub-dependency nicely, and the sub-dependency becomes problematic, you could have an issue that you often can’t directly do anything about. This can lead to a frustrating amount of work, and what feels like a lack of control over your codebase if it happens often. For this one, I don’t have a great solution. I wish I did, because it’s a problem I have had for most of my time in the industry. I wouldn’t want to say that a great solution doesn’t exist somewhere, but it’s probably going to be a burden we have to bear in the Node.js ecosystem to remain safe and secure. The biggest piece of advice is to make sure you are controlling your direct dependencies as tightly as possible the more strict your security requirements are; even when a dependency pulls in a bad sub-dependency, you can protect against the ripple effect if you keep tight control over when you bring the direct dependency in.

Conclusion

I think what happened with colors and faker is a fascinating case study into how many of us have become complacent with npm’s hidden safety concerns. I love open source software, and I believe we can all do our part to ensure we use it safely. I hope this article provided a new perspective to the situation, and whether you agree or disagree feel free to reach out and discuss! I am interested to hear about your experiences.

The death of the for loop?

Sat, 13 Mar 2021 00:00:00 +0000

NOTE (Feb 15, 2025): I think this post kinda sucks and I largely disagree with a majority of it now. I’ve decided to keep it here for posterity, but my modern sensibilities no longer line up with what I wrote here.

Generally introduced to new developers around chapter 4 or 5 of their proverbial Intro to Computer Science books, loops are one of the most fundamental coding constructs a developer learns. The different simple ways we iterate over collections of data are often the core of the most complex applications ever built. This is to dramatically justify the probably-overkill rant I am about to write regarding iterating over a collection of data.

Truthfully, this title is a misnomer. I don’t think for loops need to die. My goal with this post is to present a case for the available alternatives to traditional for loops. Though they aren’t technically wrong, I hope to demonstrate the benefits of the alternatives and how I believe they contribute to the enhancement of code quality. (Ha, my first clickbait title. Unfortunately, this site earns me nothing. Your click paid me $0.)

I will use JavaScript for the code examples since types aren’t going to be a factor here, and I feel it’s the simplest language to convey the concepts in the post. I will stay as language-agnostic as possible.

The traditional `for` Loop

I’ll start by laying out the traditional for loop everyone knows and loves. While learning the fundamentals of coding you will write your first for loops like this.

for (let i = 0; i < 10; i++) {
	// Code to execute on each iteration
}

I still think this should be the first loop a new developer learns. It’s easy to understand; execute the code inside the braces 10 times. Once the developer gets to arrays, and they learn that arrays are addressed 0, 1, 2 etc. to retrieve data, the use case of for loops suddenly clicks:

const arr = [1, 2, 3, 4, 5];
for (let i = 0; i < arr.length; i++) {
	console.log(arr[i]);
}

“The loop runs from 0 to 4, so I can use i to choose an item from the array on each iteration of the loop! Now I understand what loops are for!” - A dramatic re-enactment of me getting to the array lesson in my first coding book, feeling like a genius.
It’s understandable why a developer would reach for this by default; any professional developer is certain to have written hundreds of for loops exactly like this throughout their career, so there is rarely anything new for a developer to learn or understand.

Why fix what isn’t broken?

`foreach` loops

How many times have you seen a loop like this?

const arr = [1, 2, 3, 4, 5];
for (let i = 0; i < arr.length; i++) {
	const value = arr[i];
	// do stuff with currentValue
}

This code isn’t inherently wrong, but doesn’t it seem like a waste to write a traditional for loop just to assign the value to a local constant variable every time? We don’t actually need to modify the source collection, we just need to iterate through it and read each value individually.
Enter the foreach loop. This style of loop cuts out that boilerplate step that assigns a local constant in each iteration of your loop. Instead of each loop iteration having an index, each loop iteration will have an item. In JavaScript, this is implemented using the for...of syntax.

const arr = [1, 2, 3, 4, 5];
for (const value of arr) {
	// do stuff with value
}

In my opinion, this second option is a lot cleaner. Using this new syntax, we keep our code more concise and focused by demonstrating our intentions with the data through the syntax.
This is a microcosm of what you’ll see in the rest of this post; while it’s not wrong to use a traditional for loop, we should find ways to be more explicit about our intention when we iterate through a collection.

Higher-order Functions

Rather than mince words to try and explain what a Higher-order Function (hereby referred to as HOF) is, I will simply link its Wikipedia Article, as well as this chapter of Eloquent JavaScript since this article is largely in JavaScript.
Why are HOFs important to the goals of this post? This fundamental construct unlocks a number of elegant ways to use and transform collections of data with more specificity than is generally possible with native looping constructs.
Assuming that you have read the suggested articles or are already familiar with the required anonymous function syntax, let’s look at some HOFs that we can use to work with collections of data. While the examples will still be in JavaScript, nearly every modern language has some version of the methods we’ll discuss here.

`forEach`

We have learned about foreach loops, implemented in JavaScript as for...of. However, there’s a HOF to do essentially the same thing. Let’s restate the previous example here:

const arr = [1, 2, 3, 4, 5];
for (const value of arr) {
	// do stuff with value
}

The foreach function takes in an anonymous function with one argument that represents an individual element of the collection. So the above example could be refactored to this:

const arr = [1, 2, 3, 4, 5];
arr.forEach(
	value => {
		// do stuff with value
	}
);

When I see this function, I think “We are going to perform an operation that reads each element of the array individually” and I can focus on how we will use each element.
I started with the forEach function because it’s a great introduction to HOFs in general. However, to someone not sold on HOFs as a concept, this might look no better than a for...of loop. In truth, the justification for this goes deeper into the concept of immutability and side effects that are core to the Functional Programming Manifesto. (I capitalized that like it was a real book, but unfortunately it’s not. Lots of great reading if you search that exact phrase, though.)

In the interest of staying in scope for this post, let’s instead move ahead to some similar HOFs that I believe provide a clear new advantage.

`map`

map is designed to handle the scenario where we want to apply a transformation to every element of an array. For example you may have a loop that wants to build the 2’s timestable.

const arr = [1, 2, 3, 4, 5];
const twoTimestable = [];
for (const value of arr) {
	twoTimestable.push(value * 2);
}
// twoTimestable = [2, 4, 6, 8, 10]

When another coder reads this, they will be able to tell that this is a loop to build a new collection of data based on each element of a source. Using the map function, we can instead specify that a new collection is a result of transforming the source’s elements individually.

const arr = [1, 2, 3, 4, 5];
const twoTimestable = arr.map(value => value * 2);
// twoTimestable = [2, 4, 6, 8, 10]

When I read the second example, I see the map function and instantly think “This is a new collection that is a transformation of the original”, and I can focus on what exactly the transformation is. That’s the important part anyway; the extra code that manages assigning the results to a new collection is simply boilerplate around what I would consider the unique behaviour of the program. It’s what makes this program special, if you will.

`filter`

filter is for when we need specific data out of a collection. Let’s say we want an array containing only the elements of our source that are divisible by 3.

const arr = [1, 3, 6, 8, 12];
const divisibleBy3 = [];
for (const value of arr) {
	if (value % 3 === 0) {
		divisibleBy3.push(value * 2);
	}
}
// divisibleBy3 = [3, 6, 12]

filter will give us similar benefits to map here. filter is a function that will produce a new collection that only contains the elements of source for which the specified function returns true. So the above example can be refactored to this:

const arr = [1, 3, 6, 8, 12];
const divisibleBy3 = arr.filter(value => value % 3 === 0);
// divisibleBy3 = [3, 6, 12]

When I see filter, I think “This will be a new collection of data that passes some criteria”, and then I can focus on the criteria. As with map, that’s what makes this program special.
In my eyes, this seems like the easiest HOF to sell. It is in my opinion the most intuitive because it’s the word we would probably use in plain English to describe what we are actually trying to do.

`reduce` (traditionally known as `fold`)

This one may be the hardest to sell of the 4 HOFs we’re exploring.
reduce is used for when we want to take the elements of a collection and deduce some final result from it. A good basic example would be calculating the sum of all elements in an integer array.

const arr = [1, 2, 3, 4, 5];
let sum = 0;
for (const value of arr) {
	sum += value;
}
// sum = 15

While the idea of reduce is simple in explanation, its usage is a bit harder to wrap your head around at first. While the previous HOFs have accepted an anonymous function with a single argument (which represents the “current element” so to speak), the anonymous function we pass into reduce requires 2: the running value known as the “accumulator”, and the current element (like the previous examples). This function will then return the new value for the accumulator after whatever action for the current element. In this example, the “accumulator” will be the sum we’re calculating. We’ll seed the accumulator with some value (in this case 0) as the second argument to the outer reduce function.

const arr = [1, 2, 3, 4, 5];
const sum = arr.reduce(
	(sum, value) => sum + value,
	0
);
// sum = 15

Building a result that combines all the elements in a collection is a great use of reduce, but it can also be good for finding an element in a collection based on some criteria relative to the other elements in a collection. For example, if we wanted to find the max element in an array with a traditional for loop it would look something like this:

const arr = [1, 2, 5, 4, 3];
let max = Number.MIN_VALUE;
for (const value of arr) {
	if (value > max) {
		max = value;
	}
}
// max = 5

The name “accumulator” becomes a slight misnomer in this scenario, because rather than being an accumulation of all the values, it is simply the end result we are interested in. Ignoring that, the reduce we write is pretty similar to the earlier example:

const arr = [1, 2, 5, 4, 3];
const max = arr.reduce(
	(max, value) => {
		if (value > max) {
			return value;
		}
		return max;
	}
	Number.MIN_VALUE
);
// max = 5

You might be thinking “you silly goose, this is more lines than the original for!”
Correct, I have bamboozled you to demonstrate a common gotcha for writing these HOF argument functions; they do need to return a value. The way we’ve been writing them (without braces) implies that the calculation is the return value of the function. However if you go to write your first reduce and wonder why the heck it’s not working, the first check is to ensure that all code paths are returning a value.
This example can be written as a one-liner using a ternary expression:

const arr = [1, 2, 5, 4, 3];
const max = arr.reduce(
	(max, value) => value > max ? value : max,
	Number.MIN_VALUE
);
// max = 5

When I see a reduce, I think “This will take the source collection and build some kind of result out of it”, and I can focus on what it needs to do to find that result.

This is so sad, I love `for` loops. There must be some use for them!

Fear not! HOFs are awesome, and in a pure functional language like Haskell, you would only be using them at all times. However, if you are not living in the Pure Functional Utopia, there are some still some great uses for traditional for loops.

Modifying the source collection

This post has so far assumed that you are only reading from the source collection and producing a new result. I always strive not to modify values in code; it’s so nice to always know with certainty what everything in your program is going to contain/equal. However, if for whatever reason you are required to modify a collection in place, a traditional index for loop is still the best way to cleanly do so.

const arr = [1, 2, 3];
for (let i = 0; i < arr.length; i++) {
	arr[i] = 69;
}
// arr = [69, 69, 69];

Comparing elements directly to previous or future elements

If you are in a scenario where you need to take different action based on elements before/after the current element, the traditional index for loop is going to be your best bet.

const arr = [1, 1, 1, 2, 2, 3];
for (let i = 0; i < arr.length; i++) {
	if (arr[i] == arr[i - 1]) {
		// do something 
	} else {
		// do something different 
	}
}

You are writing Go

Let’s take a brief intermission from JavaScript. To present a new perspective.
If you’re writing Go, you can probably throw everything I’ve said in this post out the window. The way Go handles loops is essentially the antithesis to what I’ve presented so far.
Not only is a for loop the only way to iterate through a collection of data, the for keyword even replaced the while keyword. This is in service of Go’s language design philosophy, which is (in brief) to standardize around one way to do things as much as possible (a philosophy I’d love to rant on in a future post).
Go does have a func type, meaning one could implement their own map function like so:

func Map(arr []int, transform func(int) int) []int {
	result := make([]int, len(arr))
	for i, value := range arr {
		result[i] = transform(value)
	}
	return result
}

func main() {
	arr := []int{1, 2, 3, 4, 5}
	twoTimesTable := Map(
		arr,
		func(value int) int { return value * 2 },
	)
	// twoTimesTable = [2, 4, 6, 8, 10]
}

(I wrote this myself, but would not have been able to without this post from Algorithms to Go.)

The beauty and curse of Go is that it’s up the developer to implement this if they want it. I don’t know any Go developers personally, but I imagine a some would turn their nose at this while others would welcome it. If you happen to be a Go developer, I would love to hear your thoughts on this!

You are writing C

Just use for loops.

Conclusion

You may read this post and think to yourself “this guy is stupid, it’s more readable to just use a for loop”.
You might be right given the context of the code you’re writing. “Readibility” is pretty subjective, and some people may prefer to see the boilerplate that comes with the for loop examples I presented here. My goal with this post was to present my perspective; I love the way HOFs describe explicit intentions for an iteration through a collection, allowing me to focus on the part of the code that matters.

I hope that you enjoyed reading this post! This is my first public blog post and I’m pretty nervous to show it to the world, but I really hope you gained something out of it and did not leave in an unbridled rage. Please follow my socials if you enjoyed this post and want to read more of my ramblings!

Projects

Mon, 01 Jan 0001 00:00:00 +0000

Here is a quick list of my personal projects, both previous and active! Most of them are MIT licensed, with a couple exceptions.

Tools and Libraries

yamlfmt https://github.com/google/yamlfmt

Language: Go

A command line yaml formatting tool, also structured as a library for extensibility or custom wrappers.

This is my largest open source success. The tool has over 1k GitHub Stars, and each release gets tens-to-hundreds of thousands of downloads.

go-utf8-codepoint-converter https://github.com/RageCage64/go-utf8-codepoint-converter

Language: Go

Tool to convert UTF-8 codepoint text to the unicode character the text represents.

Fluent Bit Lua Tester https://github.com/RageCage64/flb_lua_tester

Language: Rust

Allows you to run Lua scripts meant for Fluent Bit scripting in a sanitized environment with specific input and expected output.

collections-go https://git.ragecage64.com/RageCage64/collections-go

Language: Go

A library that implements common data structures for Go with best possible time complexity and minimal allocations.

multilinediff https://git.ragecage64.com/RageCage64/multilinediff

Language: Go

A library to write multiline diff output to the command line.

Open Source Work

Most of my open source work is done under my Google Github profile: https://github.com/braydonk

OpenTelemetry https://github.com/open-telemetry

Language: Go

Contributing to OpenTelemetry in a couple of ways:

Codeowner of the hostmetricsreceiver in the OpenTelemetry Collector
Member of the System Semantic Conventions Working Group

Fluent Bit https://github.com/fluent/fluent-bit

Language: C

An open source observability agent, which we use on my team at Google as part of the Ops Agent. I help fix a number of bugs in Fluent Bit, as well as doing code reviews and maintenance on the out_stackdriver plugin.

Monkey https://github.com/monkey/monkey

Language: C

An HTTP server written in C. It is a crucial component of Fluent Bit, and I have done some work on this repo to support fixes in Fluent Bit, as well as adding unit tests to the repo.

YouTube

I do have a YouTube channel at https://www.youtube.com/@RageCageCodes-ik2ue. I only have one video as of writing, I was thinking I might make more but I found making tutorial content not as exciting as I’d hope. I’m keeping it on the backburner just in case!

Gaming

TrustFall https://git.ragecage64.com/RageCage64/TrustFall

Language: C++

A Root Beer Tapper ripoff that I wrote as a school project. Uses Allegro 5 because I had to (well technically I had to use 4 but I refused to do that and accepted the consequences).

SpaceForce https://git.ragecage64.com/RageCage64/SpaceForce

Language: C++

A SHMUP that I wrote also for a school project.

SeeNoEvil https://git.ragecage64.com/RageCage64/SeeNoEvil

Language: C#

My entry to the 8-bits-to-infinity game jam. I sadly did not save the assets, which is too bad but my partner who drew them insists they weren’t worth keeping. I thought they looked pretty good. :D
The main thing I want to extract out of this is the code that worked with Tiled, I thought it was reasonably sophisticated for something I coded in under a week. Would be cool to extract it into a standalone library.

Talks

Mon, 01 Jan 0001 00:00:00 +0000

This is a collection of public recorded talks I’ve done.

Prepared Talks

Deep Dive: How Fluent Bit Collects File Logs

https://www.youtube.com/watch?v=KrlvWBCGagI

This is my talk for Observability Day North America, a co-located event with KubeCon NA 2024. It was a lightning talk, but it ended up being a really dense talk and probably could have been full-sized. To compensate, I talked really fast!

T-shirt: Iron Maiden

Tuning OTel Collector Performance Through Profiling

https://www.youtube.com/watch?v=qMxxjB4meXo

This was a talk for OpenTelemetry Community Day 2024. It goes through my experience profiling parts of the OpenTelemetry Collector to find performance improvements.

Retractions: One of the solutions I talked about in this talk for Windows getting Parent Process ID had a flawed premise, ignoring the fact that the increase in WMI memory usage did offset the gains made in the Collector. So it ended up not being that big of a win, and we’re still working to find another alternate method for getting Parent Process ID.

T-shirt: Brook from One Piece Wanted Poster

How Much Overhead: How to Evaluate Observability Agent Performance

https://www.youtube.com/watch?v=BIaftvtFPHg

This is my talk for Observability Day 2023, a co-located event with KubeCon NA 2023. It was inspired by situations at work where people would ask things like “which agent has less overhead?” without fully qualifying their goals. I wanted to break down the problem down into more actionable pieces.

T-shirt: Meshuggah Catch-33

Learning To Fly: How to Find Bottlenecks in your Agents

https://www.youtube.com/watch?v=jf7t1CpoKlg&t=176s

This was a remote talk I did for the Is It Observable YouTube channel (awesome channel, highly recommend subscribing). This was perhaps the hardest I ever prepared for a talk, because it came with an in-depth reproducible demo, that ran in a Dockerfile and included code to graph OpenTelemetry Metrics directly in the CLI. It was a lot of fun to prepare and I think it’s one of my best talks. If you can get around the fact that my mic sounded TERRIBLE).

T-shirt: Zoro from One Piece

Background friend: A plush of the character Acrid from my favourite video game, Risk of Rain 2

Tutorials

5 Levels of Go Error Handling

https://www.youtube.com/watch?v=y5utZCeHys0&t=1s

This was my one attempt at “content creation”. It’s a relatively beginner-focused tutorial about Go error handling and how to do some more advanced things. The video was picked up by the algorithm this past summer and started getting a lot more attention. I’m not completely cutting myself off from making more videos in the future, but I did not have as much fun as I thought I would making this video so I’m not sure if I’ll make more. I think this tutorial is pretty good for what it is though and I’ll keep it around anyway!

T-shirt: It’s obscured!

Background friend: Acrid from Risk of Rain 2 again

Interviews

KubeCon NA 2024 with Is It Observable

https://www.youtube.com/watch?v=qf0OjAEzprs&t=365s

This was an interview with the Is It Observable YouTube channel. I talked a bit about the talk I was giving the next day at Observability Day, as well as some general best practices for managing performance of agents collecting logs.

T-shirt: Coheed and Cambria, Vaxis II tour shirt

Humans of OTel - KubeCon NA 2024

https://www.youtube.com/watch?v=TIMgKXCeiyQ

I was featured in the Humans of OTel series of interviews at KubeCon NA 2024, along with a lot of amazing peers from the OpenTelemetry Community!

T-shirt: Video filmed too high to tell!

KubeCon NA 2023 with Is It Observable

https://www.youtube.com/watch?v=5arixRhAIbs&t=161s

An interview with Is It Observable from KubeCon NA 2023. This was my first time doing something like this so I was definitely more nervous, but it was great practice!

T-shirt: Meshuggah Catch 33

Dynamic linking madness: solving a bug in go-nvml

The Issue: go-nvml crashes our OpenTelemetry Collector

Intro to dynamic libraries

Dynamic vs Static Linking

Dynamic Loading

Exporting Dynamic Symbols (Linux ELF binaries)

How go-nvml works

The Bug

Standalone Repro

Comparing Go 1.20 to 1.21

The Solution

Conclusion

Software Industry vs Software Education

My Bachelor’s Degree

University wasn’t for me, but what choice did I have?

Disdain for Bachelor’s Degrees

Do universities need to “get with the times”?

The modern software industry

Software Development as a Trade

I SNUCK AN ANTI-CAPITALIST PREMISE INTO MY BLOG POST

Conclusion

Self hosting with Caddy, gitea, hugo, bitwarden, and more!

UFW (Uncomplicated Firewall)

Caddy

Gitea

Get files from Gitea repos within the server

Bub the Discord Bot

This Blog

BitWarden

Who knows what else!

colors and faker: a case study on the npm ecosystem

Foreword

What happened with colors and faker?

What does it mean to download an npm package?

What does it mean to publish an npm package?

How we’ve forgotten this

One-way Trust

How this relates to colors and faker

You are being ridiculous

How to improve npm safety (while still using npm)

Conclusion

The death of the for loop?

The traditional for Loop

foreach loops

Higher-order Functions

forEach

map

filter

reduce (traditionally known as fold)

This is so sad, I love for loops. There must be some use for them!

Modifying the source collection

Comparing elements directly to previous or future elements

You are writing Go

You are writing C

Conclusion

Projects

Tools and Libraries

yamlfmt https://github.com/google/yamlfmt

Language: Go

go-utf8-codepoint-converter https://github.com/RageCage64/go-utf8-codepoint-converter

Language: Go

Fluent Bit Lua Tester https://github.com/RageCage64/flb_lua_tester

Language: Rust

collections-go https://git.ragecage64.com/RageCage64/collections-go

Language: Go

multilinediff https://git.ragecage64.com/RageCage64/multilinediff

Language: Go

Open Source Work

OpenTelemetry https://github.com/open-telemetry

Language: Go

Fluent Bit https://github.com/fluent/fluent-bit

Language: C

Monkey https://github.com/monkey/monkey

Language: C

YouTube

Gaming

TrustFall https://git.ragecage64.com/RageCage64/TrustFall

Language: C++

SpaceForce https://git.ragecage64.com/RageCage64/SpaceForce

Language: C++

What happened with `colors` and `faker`?

How this relates to `colors` and `faker`

The traditional `for` Loop

`foreach` loops

`forEach`

`map`

`filter`

`reduce` (traditionally known as `fold`)

This is so sad, I love `for` loops. There must be some use for them!