Pronoun usage in the Linux Kernel
Published:
I analyzed the distribution of pronouns in the 6.13 Linux kernel source. Unsurprisingly, I find that male pronouns are common, but this makes female pronouns all the more unusual.
Results
Counts by pronoun
Term | Count |
---|---|
he/him/his | 2516 |
she/her/hers | 33 |
man | many references to man |
men | 3 |
wom{a,e}n | 4 |
person | many license usages |
people | 779 |
The count for “person” is tainted by all of its usages in license verbiage. Similarly, “man” is polluted with all the references to “man pages”.
I pruned non-pronoun usages. I did not include uses of “they” or “them” because I cannot grep to distinguish between uses of the gender-neutral pronoun “they” or as the plural for neuter terms (such as referring to function calls or variable values).
Pronoun usage over time
Analysis
The kernel overwhelmingly uses male pronouns. This result is expected due to the >90% male contributor population. Bitergia looked at the contributions of men and women to the Linux kernel in 2016. Their study found that women produced 6.8% of git activity as 9.9% of the developer population. This analysis was based on identifying gender from names.
I expected there to only be uses of male or gender-neutral phrases. This hypothesis led me to inspect every usage of female pronouns.
Excluding some gender-neutral uses of “he/she”, “his/her”, and “[s]he”, the results for grepping “her” are peculiar.
“her” usage
noun which the pronoun refers to | notes |
---|---|
female student | A didactic quote with a female student [commit] |
user of the computer | unsure of authors as commits are from before the kernel switched to git [commit 1] and [commit 2] |
a disk transaction | [commit] |
busy waiter on a lock | [commit] |
“she” usage
noun which the pronoun refers to | notes |
---|---|
scientist user | [commit] |
code author | [commit] |
user | [commit] |
parody song quote | David Bowie quote [commit] |
the GPU? | This commit, by the esteemed Alyssa Rosenzweig, was the only commit using female pronouns which I could attribute to a woman. [commit] |
“woman”/”women”
The only reference to a “woman” in the kernel is in a eulogy
I would ask that anyone benefiting from this work, especially those
using it in commercial products, consider making a donation to my local
non-profit hospice organization in the name of the woman I loved, who
passed away Feb. 12, 2003.
And the only usage of “women” is in contribution guidelines documentation.
Swearing in Linux
This paper shows there is a correlation between swear word usage and code quality. In unrelated news, there is a large, negative trend line for the amount of swearing in Linux over the past ten years.
Age
The kernel continues to evolve as new developers come in. The amount of contributors joining and staying on the Linux project keeps dropping.
For example, efforts to push Rust into Linux have been causing controversy. Eventually, Linus himself will have to step down. Being a maintainer is stressful, and even maintainers are human.
Further, contributions are done less than ever by open source hackers. Linux contributions are becoming more commercial, which means that contributors’ incentives may not align with that of the Linux project as a whole. Over 90% of contributors are being paid by their employer to submit patches. Nobody wants to pay to build the infrastructure to make Linux development better (ex: Coccinelle) or to write high quality documentation. These create barriers to the natural growth of the project among younger developers and other demographics.