Elephant Factor is defined as the minimum number of companies whose employees perform 50% of the total contributions to a project.
From the previous post, we know that the Bus Factor is “a measurement of the risk resulting from information and capabilities not being shared among team members”. Elephant factor is the same concept, applied to companies.
Diversity of organizations do not guarantee more contributions, but more resilient communities. The worst scenario would be, of course, depending only on a single company that at some point leaves the project. In that case, the project could be difficult to maintain and in the worst case, the project could be discontinued. However, this is a typical scenario when a company creates and funded a project. Making their community grow after them is the actual challenge.
Following with the community layer that we started to analyze in the previous post about the Bus Factor, this post tries to shed some more light on the Organizational Diversity of the four projects we had the opportunity to analyze: Libre Health, Mifos, Open Data Kit, and Hot.
Before starting, it’s worth noticing the importance of the affiliations for this analysis. The dashboard uses a basic domain matching algorithm to set contributor affiliations. In some cases, the system doesn’t know which company corresponds to a given domain, or some people use personal e-mail addresses instead of corporate ones. The dashboard allows us to improve the affiliations data through a tool called Hatstall where we can merge several profiles into the same one, add affiliations for profiles, mark profiles as bots, and some other similar operations. Please don’t hesitate to contact us if you need help to improve current data to make results more reliable. To have a glance at current affiliations data, please visit the Affiliations Dashboard.
Based on the data we have, there are some considerations to take into account:
Hot is the most reliable source, less than 20% of contributions belong to people affiliated to ‘Unknown’.
Libre Health and Mifos are around 50% of contributions made by people affiliated to
Unknown. Their analysis could be biased by this fact. 50% of contributions is still a reasonable number to provide a view of the actual organizational diversity of the projects
Open Data Kit is a reliable source. Even having most of the people affiliated to ‘Unknown’, only around 30% of contributions were made by those people.
The following are the numbers (A) for the Elephant Factor calculated on the 21st of November 2019 and for the last year. While the following ones (B) were retrieved today for the last year using the OSCaaS Dashboard. There is certain overlapping time, but this helps to see the evolution of these numbers.
It is especially interesting to see how Mifos changed in such a short period of time. Conflux Technologies and Mifos were the two organizations leading contributions in the first period. However, currently, The Apache Software Foundation is contributing more than 50% of the total contributions by themselves. The rest of the projects are being maintained mainly by a single organization, except Open Data Kit. As we’ll see later on, this is not an isolated fact when talking about open source projects.
Nevertheless, the goal of any open source community should be to involve more and more people under its umbrella. The next step for a community or/and project manager would be looking at the data to find out which other organizations are contributing to the project and plan specific actions to engage them and increase their commitment to the project.
As in the previous post, we run another analysis to bring more context to the discussion (C). Wikimedia Foundation, OPNFV, GitLab, and Kata Containers were selected again to make the data comparable. As a reminder, there are huge projects as Wikimedia and smaller ones as Kata Containers. We show the data for the last year only just because there was no variation between the two periods we analyzed:
As you can see, having a single company or organization leading a project is something usual. That could mean that company is very interested in the project, although most of the times it is probably due to that organization created and/or funded the project. Along these lines, for an open-source community, it would be great not to rely on a single company, and that’s why in certain cases big companies decide to invest in projects they are especially interested in.
Remember, if you are interested in running a similar analysis for your project and you are part of the OSC Hub and you want your project to be part of further analysis, please reply to this post and let us know!