Third-party Identity Management Usage on the Web

A. Vapen, N. Carlsson, A. Mahanti, and N. Shahmehri, "Third-party Identity Management Usage on the Web", Proc. Passive and Active Measurement Conference (PAM), Los Angeles, CA, Mar. 2014. (pdf)

Abstract: Many websites utilize third-party identity management services to simplify access to their services. With privacy and security implications for the end users, an important question is how websites select their third-party identity providers and how this impacts the characteristics of the emerging identity management landscape seen by the users. In this paper we first present a novel Selenium-based data collection methodology that identifies and captures the identity management relationships between sites and the intrinsic characteristics of the websites that form these relationships. Second, we present the first large-scale characterization of the third-party identity management landscape and the relationships that makes up this emerging landscape. As a reference point, we compare and contrast our observations with the somewhat more understood third-party content provider landscape. Interesting findings include a much higher skew towards websites selecting popular identity provider sites than is observed among content providers, with sites being more likely to form identity management relationships that have similar cultural, geographic, and general site focus. These findings are both positive and negative. For example, the high skew in usage places greater responsibility on fewer organizations that are responsible for the increased information leakage cost associated with highly aggregated personal information, but also reduces the user's control of the access to this information.

Datasets

The datasets used in our paper are made available here for use by the wider research community. Please refer to Section 2 of our paper for a description of the data collection methodology and a summary of the datasets.
Note: If you use our datasets in your research, please include a reference to our PAM 2014 paper (pdf) in your work.