Workshop Report: RFC 9307: Report from the IAB Workshop on Analyzing IETF Data (AID) 2021
The IETF as an international Standards Developing Organization hosts diverse data on the history, development, and current activities in the development and standardization of Internet protocols and its institutions. A large portion of this data is publicly available, yet this data is arguably underutilized as a tool to inform the work in the IETF and research on topics like Internet governance and trends in ICT standard-setting.
This workshop aims to enable engineers and researchers alike to mine the IETF’s data sources in order to explore trends through the analysis of IETF data, such as email archives, I-Ds, RFCs, and the datatracker. This work can be used to derive insights into the inner workings of the process of standardization, participation, and governance1. This workshop aims to bring together people who have already analyzed IETF data, those who are interested in the analysis of IETF data, and those who are interested in the results of such analysis as input for improvement of the IETF’s work.
We invite the research community, IETF participants, and others with an interest in the data collected by the IETF, its protocols, and participants, to submit a contribution to the workshop. Furthermore, we also welcome participants who are interested in the analysis that could be performed based on this data as well as those contributing considerations regarding future collection and handling of IETF data.
Possible avenues for explorations include, but are not limited to:
- What are patterns for participation in the IETF (what are predictors for a long and productive tenure, when do people stop participating, what is needed to successfully produce RFCs)?
- How is the IETF community developing (i.e., affiliations, publications, language, nationality, leadership positions)?
- How do affiliations develop in the IETF (i.e., does a change in affiliation translate into a change in behavior, is there a relation between affiliation and leadership positions and/or centrality, what is the affiliation distribution per area and/or WG)?
- What social dynamics (gender, nationality, income, occupation, and other social dynamics) are not captured by IETF data and what data and research approaches are needed to develop further insights in the social dynamics of standardization?
- How productive and effective is the IETF, with respect to documents, pages, words, letters and in comparison the overall activities e.g. on mailing lists?
- How well is the outcome of the IETF used, e.g,. based on references to RFCs in research papers, product manuals, or other sources?
- What data would be relevant to collect that is not collected yet or what should be considered with respect to handling of personal data during the data collection and research.
- How effective is the IETF’s consensus-based decision making process? Is there evidence that documents receive broad and effective reviews? Are experts with relevant expertise engaging with developing standards in a timely manner?
Participation and Submission
People interested in participation are requested to submit short position papers (500-1000 words). The paper can cover one or multiple of the following points, but this list should not be considered exhaustive:
- Research questions and interests in IETF data; indication which question should be answered, the data needed to do so, and how these insights could be used to improve processes and operations;
- Description of the IETF data they aim to analyze or the information they would like to see made available to inform their work (such as mailing list archives, or participation data obtained through the datatracker) and their methods for doing so (see footnote 1);
- Potential and preliminary findings; and how those insights could either benefit leadership, WG chairs, and authors/participants, and/or society and industry at large;
- Potential or preliminary findings and how those add novel insights to ongoing academic debates.
Proposals for data analysis should also contain a brief consideration of any related ethics and privacy issues. The basic principles of ethical research are outlined in the Belmont Report2 (covering e.g., respect for persons, beneficence, and justice) and/or institutional ethics guidelines.
The workshop will be invitation-only. The organizers will decide whom to invite based on the submissions received. Therefore, please indicate your interest by submitting a research proposal by September 29, 2021 to email@example.com.
The Program Committee members are Niels ten Oever (chair, University of Amsterdam), Colin Perkins (chair, IRTF, University of Glasgow), Corinne Cath (chair, Oxford Internet Institute), Mirja Kühlewind (IAB, Ericsson), Zhenbin Li (IAB, Huawei), Wes Hardaker (IAB, USC/ISI).
All inputs submitted and considered relevant will be published on the workshop web page. Sessions will be organized according to content, and not every accepted submission or invited attendee will have an opportunity to present as the intent is to foster discussion and not simply to have a sequence of presentations.
Position papers from those unable to attend in person are encouraged. A workshop report will be published afterwards.
- Submissions Due: 29 September 2021
- Invitations Issued by: 15 October 2021
- Workshop Date: November 29 – December 2 2021
- Location: Online
The workshop will consist of three parts:
- opening workshop (Monday)
- hackathon (Tuesday – Thursday morning)
- closing event (Thursday afternoon)
Feel free to contact the program committee with any further questions (including questions related to available data or expected outcomes): firstname.lastname@example.org
All times are UTC.
Monday, Nov 29 (Video)
- 14:00: Opening (Ten Oever)
- 14:15: Tools, data, and methods (Arkko)
- 15:00: Observations on affiliation and industry control (Cath)
- 15:45: Community and diversity (Hardaker)
- 16:30: Break
- 17:00: Publications, process, and decision-making (Perkins)
- 17:45: Taking stock: questions and hacking groups (Kühlewind)
- 18:30: Closing (Cath)
- 19:00: End of Day
Tuesday, November 30
- 14:00-18:00: Hackathon
- 16:00-16:30: Sync-up
Wednesday, December 1
- 14:00-18:00: Hackathon
- 16:00-16:30: Sync-up
Thursday, December 2 (Video)
- 14:00: Opening (Li)
- 14:15: Environmental Sustainability – initial results (Perkins)
- 14:45: Break
- 15:15: Result presentation (Ten Oever)
- 17:30: Wrap-up (Cath and Kühlewind)
- 18:00: End of Day
Session: Tools, data, and methods
- Using Complex Systems Analysis to Identify Organizational Interventions (Sebastian Benthall)
- The ietfdata Library (Stephen McQuistin, Colin Perkins)
- The RFC Prolog Database (Marc Petit-Huguenin)
- Observations about IETF process measurements (Jari Arkko)
Session: Observations on affiliation and industry control
- Competition for Leadership Positions in Standards Development Organizations (Justus Baron, Olia Kanevskaia)
- Analyzing IETF Data: Changing affiliations (Nick Doty)
- Position Paper (Don Le)
- Research Proposal (Elizaveta Yachmeneva)
Session: Community and diversity
- Characterizing the IETF through its consensus mechanisms (Priyanka Sinha, Michael Ackermann, Pabitra Mitra, Arvind Singh, Amit Kumar Agrawal)
- Would feminists have built a better internet? (Mallory Knodel)
- Identifying temporal trends in IETF participation (Wes Hardaker, Genevieve Bartlett)
- Who is the Average IETF Participant? (Lars Eggert)
- Representation is Not Sufficient for Selecting Gender Diversity (Emanuele Tarantino, Justus Baron, Bernhard Ganglmair, Nicola Persico, Timothy Simcoe)
Session: Publications, process, and decision-making
- Understanding Internet Protocol Design Decisions (Michael Welzl, Carsten Griwodz, Safiqul Islam)
- Characterising the IETF through the lens of RFC deployment (Ignacio Castro et al)
- The Impact of Continuity (Carsten Griwodz, Safiqul Islam, Michael Welzl)
- RFCs Change (Paul Hoffman)
- The Challenges of Cross-Document Coreference Resolution in Email (Xue Li, Sara Magliacane, Paul Groth)
- Project in time series analysis: e-mailing lists (Amelia Andersdotter)
- Position Paper (Mark McFadden)
Session: Environmental Sustainability – initial results
- Towards Environmental Sustainability with the IETF (Christoph Becker)
- CO2eq: Estimating Meetings’ Air Flight CO2 Equivalent Emissions: An Illustrative Example with IETF meetings (Daniel Migault)
1 Examples of such approaches are: https://www.arkko.com/tools/docstats.html, http://datactive.github.io/bigbang/, https://csperkins.org/research/protocol-standards/2020-12-10-ignacio-iesg-talk/2020-12-10_IESG-50-years-IETF-send.pdf, https://sodestream.github.io/impact-of-early-engagement-on-longevity-of-ietf-participation.html