Inferring Tracker-Advertiser Relationships in the Online Advertising Ecosystem using Header Bidding

Open access

Abstract

Online advertising relies on trackers and data brokers to show targeted ads to users. To improve targeting, different entities in the intricately interwoven online advertising and tracking ecosystems are incentivized to share information with each other through client-side or server-side mechanisms. Inferring data sharing between entities, especially when it happens at the server-side, is an important and challenging research problem. In this paper, we introduce Kashf: a novel method to infer data sharing relationships between advertisers and trackers by studying how an advertiser’s bidding behavior changes as we manipulate the presence of trackers. We operationalize this insight by training an interpretable machine learning model that uses the presence of trackers as features to predict the bidding behavior of an advertiser. By analyzing the machine learning model, we can infer relationships between advertisers and trackers irrespective of whether data sharing occurs at the client-side or the server-side. We are able to identify several server-side data sharing relationships that are validated externally but are not detected by client-side cookie syncing.

If the inline PDF is not rendering correctly, you can download the PDF file here.

  • [1] AppNexus Joins Comscore Industry Trust Comscore. https://www.comscore.com/ita/Public-Relations/Blog/AppNexus-Joins-comScore-Industry-Trust 2015.

  • [2] DoubleVerify Launches with the New AppNexus Spend Protection Program. https://www.doubleverify.com/newsroom/doubleverify-launches-with-the-new-appnexus-spend-protection-program 2015.

  • [3] General Data Protection Regulation (GDPR). https://gdprinfo.eu 2016.

  • [4] Personalization Delivers 3X Consumer Engagement With Digital Advertising Jivox. https://www.jivox.com/press/personalization-delivers-3x-consumer-engagement-with-digital-advertising/ 2016.

  • [5] OpenX Strengthens Product and Technology Teams with Key Hires OpenX. https://www.openx.com/company/press-releases/openx-strengthens-product-technology-teams-key-hires 2017.

  • [6] Rubicon Project Partners with Kiip to Automate Mobile In-App Rewarded Inventory Rubicon Project. http://investor.rubiconproject.com/news-releases/news-release-details/rubicon-project-partners-kiip-automate-mobile-app-rewarded 2017.

  • [7] Server-to-Server Header Bidding: The Pros and Cons The AppNexus Team. https://www.appnexus.com/blog/server-server-header-bidding-pros-and-cons 2017.

  • [8] The Economic Contribution of Digital Advertising in Europe IHS Markit. https://datadrivenadvertising.eu/wp-content/uploads/2017/09/DigitalAdvertisingEconomicContribution_FINAL-1.pdf 2017.

  • [9] Basic Attention Token (BAT): Blockchain Based Digital Advertising. https://basicattentiontoken.org/BasicAttentionTokenWhitePaper-4.pdf 2018.

  • [10] EasyList. https://easylist.to/easylist/easylist.txt 2018.

  • [11] Easyprivacy. https://easylist.to/easylist/easyprivacy.txt 2018.

  • [12] Server Postback Tracking Explained. https://help.tune.com/hasoffers/server-postback-tracking-explained 2018.

  • [13] The California Consumer Privacy Act of 2018. https://leginfo.legislature.ca.gov/faces/billTextClient.xhtml?bill_id=201720180AB375 2018.

  • [14] Tracking Cookies and ITP 2.0. https://support.partnerstack.com/hc/en-us/articles/360011902273-Tracking-Cookies-and-ITP-2-0 2018.

  • [15] Ad Tech Insights - Header Bidding Industry Index Adzerk. https://adzerk.com/assets/reports/AdTechInsights_Feb2019.pdf 2019.

  • [16] Alexa - Top Sites by Category: The top 500 sites on the web Alexa - An Amazon.com company. https://www.alexa.com/topsites/category 2019.

  • [17] AppNexus Third Party Providers OpenX. https://www.appnexus.com/third-party-providers 2019.

  • [18] Cookie Matching|Authorized Buyers Google. https://developers.google.com/authorized-buyers/rtb/cookie-guide 2019.

  • [19] Cookie Policy Automattic. https://automattic.com/cookies 2019.

  • [20] Firefox Lightbeam by Mozilla. https://addons.mozilla.org/en-US/firefox/addon/lightbeam/ 2019.

  • [21] Flattr - Contributors. https://flattr.com/contributors 2019.

  • [22] Ghostery makes the Web Cleaner Safer and Faster! https://www.ghostery.com/ 2019.

  • [23] Google Contributor. https://contributor.google.com/v/beta 2019.

  • [24] Is your pregnancy app sharing your intimate data with your boss? https://www.washingtonpost.com/technology/2019/04/10/tracking-your-pregnancy-an-app-may-be-more-public-than-you-think/ 2019.

  • [25] OpenX and Microsoft Announce Advertising Partnership OpenX. https://www.openx.com/company/press-releases/openx-and-microsoft-announce-advertising-partnership/ 2019.

  • [26] Privacy Badger Electronic Frontier Foundation. https://www.eff.org/privacybadger 2019.

  • [27] Server-side Tracking: General discussion and Common issues in Server-side Tracking Woopra. https://docs.woopra.com/docs/serverside-tracking 2019.

  • [28] Strategic Alliances - Index Exchange. https://www.indexexchange.com/alliances 2019.

  • [29] Surf The Web With No Annoying Ads. https://adblockplus.org 2019.

  • [30] US Digital Ad Spending Will Surpass Traditional in 2019 eMarketer. https://www.emarketer.com/content/us-digital-ad-spending-will-surpass-traditional-in-2019 2019.

  • [31] Why 2018 Was The Year Header Bidding Realized Its Potential AdExchanger. https://adexchanger.com/ad-exchange-news/why-2018-was-the-year-header-bidding-realized-its-potential 2019.

  • [32] G. Acar C. Eubank S. Englehardt M. Juarez A. Narayanan and C. Diaz. The Web Never Forgets: Persistent Tracking Mechanisms in the Wild. In ACM Conference on computer and Communications Security (CCS) 2014.

  • [33] A. Acquisti L. K. John and G. Loewenstein. What is privacy worth? The Journal of Legal Studies 42(2):249–274 2013.

  • [34] M. Backes S. Bugiel and E. Derr. Reliable Third-Party Library Detection in Android and its Security Applications. In ACM Conference on computer and Communications Security (CCS) 2016.

  • [35] M. A. Bashir S. Arshad W. Robertson and C. Wilson. Tracing Information Flows Between Ad Exchanges Using Retargeted Ads. In Proceedings of the 25th USENIX Security Symposium 2016.

  • [36] M. A. Bashir and C. Wilson. Diffusion of User Tracking Data in the Online Advertising Ecosystem. In Proceedings on Privacy Enhancing Technologies (PETS) 2018.

  • [37] J. Brookman P. Rouge A. Alva and C. Yeung. Cross-device tracking: Measurement and disclosures. Privacy Enhancing Technologies Symposium (PETS) 2017.

  • [38] I. Campbell. Chi-squared and Fisher-Irwin tests of two-bytwo tables with small sample recommendations. Statistics in Medicine 26(19):3661–3675 2007.

  • [39] J. P. Carrascal C. Riederer V. Erramilli M. Cherubini and R. de Oliveira. Your Browsing Behavior for a Big Mac: Economics of Personal Information Online. In 22nd International Conference on World Wide Web (WWW) 2013.

  • [40] T. Chen I. Ullah M. A. Kaafar and R. Boreli. Information Leakage through Mobile Analytics Services. In ACM Workshop on Mobile Computing Systems and Applications (HotMobile) 2014.

  • [41] D. Cvrcek M. Kumpost V. Matyas and G. Danezis. A study on the value of location privacy. In ACM Workshop on Privacy in Electronic Society (WPES) pages 109–118. ACM 2006.

  • [42] G. Danezis S. Lewis and R. J. Anderson. How much is location privacy worth? In Workshop on the Economics of Information Security (WEIS) 2005.

  • [43] A. Dey. Header Bidding vs RTB: Understanding the Differences. https://blognife.com/2018/09/08/header-bidding-vsrtb-understanding-the-differences/ 2018.

  • [44] W. Dou. Will Internet Users Pay for Online Content? Journal of Advertising Research 44 02 2005.

  • [45] S. Englehardt and A. Narayanan. Online Tracking: A 1-million-site Measurement and Analysis. In ACM Conference on Computer and Communications Security (CCS) 2016.

  • [46] J. Estrada-Jiménez J. Parra-Arnau A. Rodríguez-Hoyos and J. Forné. On the regulation of personal data distribution in online advertising platforms. Engineering Applications of Artificial Intelligence 82:13–29 2019.

  • [47] L. Fisher. Surveying The Digital Future The 2017 Digital Future Report Center for the Digital Future at USC Annenberg. http://www.digitalcenter.org/wp-content/uploads/2013/10/2017-Digital-Future-Report.pdf 2017.

  • [48] L. Fisher. US Programmatic Ad Spending Forecast Update 2018 eMarketer. https://www.emarketer.com/content/usprogrammatic-ad-spending-forecast-update-2018 2018.

  • [49] L. Fisher. Header Bidding Update 2018. What’s the Outlook for Web Mobile App and Video? https://www.emarketer.com/content/header-bidding-update-2018 2019.

  • [50] I. fouad N. Bielova A. Legout and N. Sarafijanovic-Djuki. Tracking the Pixels: Detecting Unknown Web Trackers via Analysing Invisible Pixels. In arXiv:1812.01514v2 2019.

  • [51] G. A. Fowler. It’s the middle of the night. Do you know who your iPhone is talking to? https://www.washingtonpost.com/technology/2019/05/28/its-middle-night-do-you-know-who-your-iphone-is-talking 2019.

  • [52] A. Ghosh M. Mahdian R. P. McAfee and S. Vassilvitskii. To match or not to match: Economics of cookie matching in online advertising. ACM Transactions on Economics and Computation 3 2015.

  • [53] J. González Cabañas A. Cuevas and R. Cuevas. FDVT: Data Valuation Tool for Facebook Users. In ACM Conference on Human Factors in Computing Systems (CHI) 2017.

  • [54] J. Grossklags and A. Acquisti. When 25 Cents is Too Much: An Experiment on Willingness-To-Sell and Willingness-To-Protect Personal Information. In Workshop on the Economics of Information Security (WEIS) 2007.

  • [55] L. Handley. Half of all advertising dollars will be spent online by 2020 equaling all combined ‘offline’ ad spend globally. https://www.cnbc.com/2017/12/04/global-advertising-spend-2020-online-and-offline-ad-spend-to-be-equal.html 2016.

  • [56] R. Hill. An efficient blocker for Chromium and Firefox. Fast and lean uBlock Origin. https://github.com/gorhill/uBlock#ublock-origin 2019.

  • [57] B. A. Huberman E. Adar and L. R. Fine. Valuating Privacy. IEEE Security & Privacy 3(5):22–25 2005.

  • [58] V. Kalavri J. Blackburn M. Varvello and K. Papagiannaki. Like a Pack of Wolves: Community Structure of Web Trackers. In International Conference on Passive and Active Network Measurement (PAM) 2016.

  • [59] A. Le J. Varmarken S. Langhoff A. Shuba M. Gjoka and A. Markopoulou. AntMonitor: A system for monitoring from mobile devices. In Proceedings of the 2015 ACM SIGCOMM Workshop on Crowdsourcing and Crowdsharing of Big (Internet) Data pages 15–20. ACM 2015.

  • [60] A. Lerner A. K. Simpson T. Kohno and F. Roesner. Internet Jones and the Raiders of the Lost Trackers: An Archaeological Study of Web Tracking from 1996 to 2016. In USENIX Security Symposium 2016.

  • [61] M. Lesk. Micropayments: An idea whose time has passed twice? IEEE Security Privacy 2(1):61–63 2004.

  • [62] T.-C. Lin J. S.-C. Hsu and H.-C. Chen. CUSTOMER WILLINGNESS TO PAY FOR ONLINE MUSIC: THE ROLE OF FREE MENTALITY. Journal of Electronic Commerce Research 14(4) 2013.

  • [63] J. R. Mayer and J. C. Mitchell. Third-Party Web Tracking: Policy and Technology. In IEEE Symposium on Security and Privacy 2012.

  • [64] G. Merzdovnik M. Huber D. Buhov N. Nikiforakis S. Neuner M. Schmiedecker and E. Weippl. Block Me If You Can: A Large-Scale Study of Tracker-Blocking Tools. In IEEE European Symposium on Security and Privacy 2017.

  • [65] R. Molla. Next year people will spend more time online than they will watching TV. That’s a first. https://www.recode.net/2018/6/8/17441288/internet-time-spent-tv-zenith-data-media 2018.

  • [66] N. Nguyen. Latest Firefox Rolls Out Enhanced Tracking Protection. https://blog.mozilla.org/blog/2018/10/23/latest-firefox-rolls-out-enhanced-tracking-protection/ 2018.

  • [67] L. Olejnik M.-D. Tran and C. Castelluccia. Selling Off Privacy at Auction. In Proceedings of the 2014 Network and Distributed System Security Symposium. Internet Society 11 2014.

  • [68] M. Pachilakis P. Papadopoulos E. P. Markatos and N. Kourtellis. No More Chasing Waterfalls: A Measurement Study of the Header Bidding Ad-Ecosystem. arXiv:1907.12649 2019.

  • [69] P. Papadopoulos N. Kourtellis and E. Markatos. Cookie Synchronization: Everything You Always Wanted to Know But Were Afraid to Ask. In The Web Conference (WWW) 2019.

  • [70] P. Papadopoulos N. Kourtellis and E. P. Markatos. The Cost of Digital Advertisement: Comparing User and Advertiser Views. In World Wide Web Conference (WWW) 2018.

  • [71] P. Papadopoulos N. Kourtellis P. R. Rodriguez and N. Laoutaris. If You Are Not Paying for It You Are the Product: How Much Do Advertisers Pay to Reach You? In ACM Internet Measurement Conference (IMC) 2017.

  • [72] A. Razaghpanah R. Nithyanand N. Vallina-Rodriguez S. Sundaresan M. Allman and C. K. P. Gill. Apps Trackers Privacy and Regulators: A Global Study of the Mobile Tracking Ecosystem. In Network and Distributed System Security Symposium (NDSS) 2018.

  • [73] I. Reyes P. Wijesekera A. Razaghpanah J. Reardon N. Vallina-Rodriguez S. Egelman and C. Kreibich. “Is Our Children’s Apps Learning?” Automatically Detecting COPPA Violations. In IEEE Workshop on Technology and Consumer Protection (ConPro) 2017.

  • [74] A. Senior. John Hancock Leaves Traditional Life Insurance Model Behind to Incentivize Longer Healthier Lives. https://www.johnhancock.com/content/johnhancock/news/insur ance/2018/09/john-hancock-leaves-traditional-life-insurance-model-behind-to-incentivize-longer-healthier-lives.html 2018.

  • [75] N. Vallina-Rodriguez J. Shah A. Finamore Y. Grunenberger K. Papagiannaki H. Haddadi and J. Crowcroft. Breaking for commercials: characterizing mobile advertising. In ACM Internet Measurement Conference (IMC) 2012.

  • [76] N. Vallina-Rodriguez S. Sundaresan A. Razaghpanah R. Nithyanand M. Allman C. Kreibich and P. Gill. Tracking the Trackers: Towards Understanding the Mobile Advertising and Tracking Ecosystem. In Workshop on Data and Algorithmic Transparency (DAT) 2016.

  • [77] C. L. Wang Y. Zhang L. R. Ye and D.-D. Nguyen. Subscription to fee-based online services: What makes consumer pay for online content? Journal of Electronic Commerce Research 6(4):304 2005.

  • [78] J. Wilander. Intelligent Tracking Prevention 2.0. https://webkit.org/blog/8311/intelligent-tracking-prevention-2-0/ 2018.

  • [79] S. Zimmeck J. S. Li H. Kim S. M. Bellovin and T. Jebara. A Privacy Analysis of Cross-device Tracking. In USENIX Security Symposium 2017.

Search
Journal information
Metrics
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 23 23 23
PDF Downloads 65 65 65