Artificial (Un)Intelligence: Automated Tools and Intermediary Liability

By Shivani Kabra

The exponential increase of user generated content on online platforms has eventuated in a proliferate system of digital piracy and unlawful content. These instances raise questions of attaching secondary liability to the platforms hosting such materials. Predominantly, online platforms act as intermediaries for facilitating user interaction and user generated content. Recent legislative developments in India regarding intermediary’s liability require intermediaries to take proactive steps towards regulating the digital space. Intermediaries are now obligated to employ automated technology-based tools for enforcing third party rights by detecting illegal/unlawful content. As a result, the tools facilitate the creation of a parallel legal system that necessitate specific socio- fiscal costs for the society at large. A deeper examination of these costs is required to determine the efficacy of the legislative developments enabling their enactment.

I. Introduction

Technological developments that impact societal trends necessitate corresponding changes in law.[1] The advent and growth of social media platforms such as TikTok, YouTube, Facebook, and Instagram have dramatically increased user interactions on internet.[2] An increase in user interactions has simultaneously created a wealth of user generated content (“UGC”) freely available for public consumption.[3] Perhaps the most accurate reflection of today’s digital culture is reflected in TikTok’s user base.[4] Reportedly, TikTok has been downloaded more than 750 million times in 2019,[5] and is the most downloaded application of 2020[6] – encompassing millions of users creating publicly available UGC.

The primary concern with UGC on social media platforms is of attributing liability for potential instances of infringement and illegality.[7] Often, UGC are created, based upon, or influenced by pre-existing content that may already be protected under intellectual property laws.[8] To illustrate this point, one only needs to assess the genesis of the disputes concerning the Harlem Shake memes.[9] In 2012, Harry Rodriguez (DJ Baauer) had originally composed a track labelled, ‘Harlem Shake.’[10] Thereafter, multiple third-party remixes of the Harlem Shake song were uploaded on YouTube in 2013.[11]Despite the 2012 Harlem Shake song borrowing elements from pre-existing works (specifically the 1900s dance style known as “Harlem Shake”), the subsequent remixed content were disputed as infringements and accordingly monetized by the rights-holder.[12] This situation however becomes murkier when the question of liability arises for YouTube.[13]Since the infringing remixed UGC were uploaded on the platform, should the burden of liability for claims of digital piracy be shared indirectly with the platform?

This paper aims to contribute towards this debate by assessing the extant frameworks concerning intermediary’s liabilities within Indian and comparative jurisprudence. In doing so, the paper attempts to analyse the efficacy of applicable laws while specifically focusing on the merits of using technological tools to combat digital piracy. At the advent, it is necessary to understand the nature and categories of liability, broadly, – primary  and secondary liability.[14] Primary liability entails that individuals be responsible and liable for their own actions.[15] In contrast, secondary liability refers to instances of persons being responsible and liable for another’s actions.[16] Contextually, YouTube’s liability for infringing Harlem Shake remixes falls within the ambit of secondary liability such that, YouTube is liable for the actions (as well as the UGC) of its users.[17] Accordingly, the scope of this paper is limited to understanding intermediary’s liability for copyright infringement on account of UGC uploaded on their platforms.

II. Intermediary Liability for Copyright Infringement in India

An understanding of intermediary’s liability requires one to comprehend the scope and contours of the term “intermediary.”[18] Section 2(w) of the Information Technology Act, 2000 (“IT Act”) defines an intermediary as “any person who on behalf of another person, stores or transmits electronic records or provides services with respect to that record.”[19] Considering the said definition, it is reasonable to assume, prima facie, that TikTok, YouTube, Facebook and other such social media platforms involved with uploading and sharing of UGC will be considered to be an intermediary under Indian laws.[20]

For a better understanding of the definitional ambit under Section 2(w), reliance is placed on two cases.[21] In Myspace Inc v. Super Cassettes Industries[22] (“Myspace Inc Case”), Myspace was considered to be an intermediary under Section 2(w) because it was a neutral platform that did not add, modify, or contribute any information of its own towards the UGC on its platform.[23] Likewise, in another case, the platform of Unacademy was excluded by courts from the definitional ambit of ‘intermediary’ on the grounds that – (i) UGC was created using Unacademy’s software, (ii) UGC was published only after the approval of Unacademy, and (iii) Unacademy had the authority to reject, edit, modify, or change UGC.[24]Accordingly, the extensive influence and control of Unacademy over UGC made it a non-neutral platform.[25] Therefore, the determinative factor for identifying a platform as an intermediary under Indian laws is the extent of editorial control exercised by the platform over their UGC.[26]

Further, the standard and extent of secondary liability for intermediaries is encompassed within Section 79 of the IT Act.[27] Section 79 provides a safe harbour clause for intermediaries thereby exempting them from (secondary) liability arising out of third party information or actions (namely, liability arising from infringing UGC), subject to compliance with certain due diligence obligations.[28]

However, the safe harbour exception is limited by the application of Section 51 and 52 of the Copyright Act, 1956. As per Section 52(1)(c), intermediaries are not “responsible for secondary liability unless they are aware or have reasonable grounds for believing that they are storing an infringing copy [i.e. UGC].”[29] Similarly, Section 51(a)(ii) precludes secondary liability “if the person had no knowledge or reason to believe that a work was an infringement.”[30] Therefore, it can be concluded that for an intermediary to incur secondary liability, they must necessarily have some knowledge, awareness, or reasonable belief that the content being uploaded or shared on their platforms amounts to digital piracy/ infringement or unlawful content.[31] This has been reaffirmed by the decision in the Myspace Inc case, wherein an intermediary was held as liable for content on its platform only when (i) they had actual or specific knowledge of the infringing content and (ii) did not take necessary steps to remove such infringing content.[32]

While the pre-requisite of ‘knowledge’ has been qualified under law as ‘reasonable and specific awareness,’ the theoretical determination of ‘necessary steps for removing digital piracy’ needs to be discussed further.[33] This topic is especially relevant in light of the recent legislative developments within the Indian space.[34] Prior to 2021, the Information Technology (Intermediaries Guidelines) Rules, 2011 excluded intermediaries from secondary liability “if they did not knowingly host, publish or transmit infringing information.”[35] This exception was applicable only in the absence of any editorial control employed by intermediaries over UGC.[36] The introduction of the draft Information Technology (Intermediaries Guideline) Amendment Rules, 2018[37] (“2018 Draft Rules”) was the first step in attempting to quantify the extent of secondary liability imposed upon intermediaries.[38] One of the most prominent obligations under the 2018 Draft Rules required intermediaries to “deploy technology based automated tools for proactively identifying and removing unlawful information or content”[39] [including infringing UGC] on their platform.[40]

Interestingly, the recent notification of the Information Technology (Intermediary Guidelines and Digital Media Ethics Code) Rules, 2021[41] (“2021 Rules”) provides further guidance on the necessary steps required from intermediaries in combating illegal UGC.[42] Similar to the 2018 Draft Rules, the 2021 Rules obligate intermediaries to observe certain standards of due diligence – in absence of which they will not be able to claim benefits of the safe harbour exception.[43]. For instance, intermediaries are required to (i) inform users of its policies and regulations,[44] and (ii) report cyber security incidents to Indian Computer Emergency Response Team[45]. Notably though, the 2021 Rules differ from the 2018 Draft Rules insofar as technology based automated tools are only required from significant social media intermediary (as defined in the 2021 Rules) for detecting certain kinds of unlawful UGC (as define din the 2021 Rules) but not including infringing UGC.[46]

However, regardless of the provisions of 2021 Rules, it is interesting to note that intermediaries in India have already commenced using technology based automated tools for detecting infringing UGC in order to off-set their liability pursuant to IT Act and Copyright Act, 1957.[47]  Thus, while further developments are awaited on this subject, it becomes important to understand the functioning and implications of these tools for the Indian jurisprudence and industries, in order to better mould the Indian policies.[48] So far as the economic costs are concerned, building and enacting such tools necessitates complete business overhauls and immense investments.[49] For example, Google has expended upwards of USD 100 million in building and implementing its automated enforcement tool – ‘Content ID.’[50] Yet, the more pressing concern that arises relates to the potential social costs of these tools – for fiscal costs may vary across companies though the social costs will remain largely standard due to the common components[51] that these tools share.

 III. Technology-Based Automated Enforcement Tools

To date, automated enforcement tools in mainstream prominence have been largely governed by the laws of the United States.[52] The overarching law regarding intermediary liability in the United States has been laid down in the Digital Millennium Copyright Act, 1998 (“DMCA”).[53] Section 512(c) of the DMCA attaches secondary liability for infringing UGC against intermediaries “if they are aware of the presence of the infringing material or upon obtaining such knowledge, do not act expeditiously to remove or disable such infringing content.”[54] Notably, while the DMCA requires intermediaries to take steps towards removing infringing UGC upon obtaining knowledge of the same, it does not require the employment of automated enforcement tools.[55] Nevertheless, regardless of the clear absence of an obligation to this effect, certain platforms, such as YouTube and Facebook, have been employing automated enforcement tools for effective identification of illegal (and infringing) UGC.[56]

1. Technology-Based Automated Tools Employed by Youtube and Facebook

The primary aim of Content ID is to identify infringing UGC that are uploaded on YouTube, in order to disclaim YouTube’s secondary liability for such infringing content.[57] Content ID has been built with the objective of aiding content owners to detect infringing copies of their work on YouTube.[58] In order to work, content owners have to submit their copyright protected work on YouTube.[59] Thereafter, Content ID creates ‘fingerprints’ of submitted works in its database.[60] These fingerprints are scanned against works uploaded on YouTube for determining if there is a match, copy, or imitation of the original work submitted to YouTube database.[61] If such a match is detected, the copyright owner has the option to block, monitor, or monetize[62] the infringing UGC.[63] Subsequently, the alleged infringer too has the option of disputing the claim of infringement within the limits of Content ID.[64]

Similar to YouTube, Facebook too uses an automatic technology based tool known as, ‘Rights Manager.’[65] Rights Manager works by detecting matching audio and video content based on rules and conditions (“Match Rules”) set by the rights holder.[66] The rights holder (subscriber of the tool/ copyright owner) uploads reference files and indicates whether they own the rights to the video, audio, or both.[67] Thereafter, Rights Manager finds matches and applies Match Rules such as blocking, monitoring, or monetizing of the infringing content.[68] Similar to Content ID, Rights Manager also allows the alleged infringer to dispute claims as per the procedure established within Rights Manager.[69]

An analysis of both the tools highlights the following key common features: (i) they act as an ‘upload filter’ for UGC uploaded on their respective platform,[70] (ii) they involve automatic filtering that precludes manual or human review of ‘detected matches’ at the first instance,[71] (iii) they allow the copyright holder/rights holder to claim ad earnings from the allegedly infringing UGC through the option of monetization,[72] (iv) they have an internal dispute resolution system for claims made via the tools,[73] and (v) they are universally applicable across territorial borders. [74] As a result, the tools create a distinct legal environment with specific social costs that are discussed in the next section of this paper.[75]

2. Assessing the Social Cost of Automated Tools: Conduits of Control Mechanism

Considering the key features of the tools conceptualized by YouTube and Facebook (and by default Instagram),[76] it is easy to recognize the ability of automated tools to function as an independent control mechanism for copyright infringement on their respective platforms. For the same, the paper analyses automated tools from a threefold socio-legal perspective of (i) limited analytical capacity of algorithms, (ii) universality of the tools, and (iii) birth of a parallel dispute resolution system.[77]

First, the automated tools do not consider legal exceptions to copyright infringement such as the fair use doctrine in USA or any of the exceptions under Section 52 of the Copyright Act.[78] Instead, the rights holders are required to determine the applicability of such exceptions prior to filing a claim for infringement.[79] The primary rationale behind absence of consideration is the diminished ability of automated content identification systems to distinguish fair use or statutorily exempted use from actual infringement.[80] In legal frameworks, interpretations of such exceptions to infringement are largely subjective in nature and left to the discretion of the judiciary on a case to case basis.[81] The contextual and dynamic understanding of the exceptions thus renders them outside the purview of a machine’s or an algorithm’s ability of discernment.[82]

Second, the universal application of automated tools across territorial borders fails to consider lex loci copyright laws and interpretations, i.e. laws of the country in which the transaction is performed.[83] Disregarding lex loci laws in favour of the interpretations posited by the automated tools occurs on two grounds: (i) standard of copyright protection and (ii) standard of infringement.[84]

Copyright is a statutory right bound within the contours of each country’s legislations.[85] Accordingly, the standard of copyright protection provided for a certain work varies across countries.[86] For illustration, United States’ laws require all forms of protectible work to be fixed in a stable and permanent medium[87] while such condition of fixation is only expressly present within the Indian law for “dramatic works.”[88] In contrast, automated tools operate under the presumption that all uploaded content, such as videos or sound recordings, are protected under copyright law.[89] This attribution of protection for certain works by the automated tools happens in absence of any verification regarding the lex loci understanding of subject matter categories of works or subject matter eligibility/qualifications.[90] For illustration, the definitional understanding of ‘dramatic works’ under the US laws requires such works to convey ‘a story or theme through a series of dramatic situations’[91] whereas no such limitation is imposed on the understanding of ‘dramatic works’ under Indian laws.[92] Further, the Indian law specifically disavows copyright protection for ad libitum works,[93] while the automated enforcement system presumes protection for such works on its platforms.[94]

In addition, the standard of infringement under varied laws is ordinarily understood to be that of substantial similarity,[95]subject only to certain exceptions such as the ‘de minimis’ rule.[96] The de minimis rule qualitatively and quantitatively assesses the size and extent of copying and excludes insubstantial copying from the ambit of ‘infringement.’[97]Contrarily, automated tools are not equipped to conduct a qualitative or quantitative assessment of the infringing content against the entire uploaded content as per the varying requirements of lex loci laws.[98] By disregarding whether a work is capable of being protected under the subject matter categories and qualifications of lex loci, or if an instance of copying sufficiently satisfies the understanding of infringement under lex loci, automated tools put forward a harmonized and alternative understanding of copyright.[99]

Third, the tools create an internal and distinctly separate system of dispute resolution while providing rights holder with a definitive set of remedies in the form of blocking, monitoring, or monetizing the infringing UGC.[100] Selection of remedies for instances of detected infringement is left to the complete discretion of the rights holders.[101] The discretionary usage in turn facilitates differential treatment for similar instances of infringement.[102] Further, the list of remedies and steps for dispute resolution act as a comprehensive alternative to the traditional legal system under lex loci.[103] Besides the predetermined methods, rights holders or alleged infringers are not allowed to seek other forms of resolution through the automated tools.[104]

On basis of the above, it is recognized that automated tools not only reflect the intermediary’s perception of a controlled setting for identifying and resolving infringement disputes but also promote the conception of a parallel copyright system to that of lex loci.[105] Customarily, the key components of legal systems provide for its defined scope and purpose, instances of contravention, and resolution mechanisms.[106] These key components are satisfied under automated enforcement systems through their independent and separate procedures for submitting infringement claims and appeals, along with the standardized understanding of their subject matter and scope i.e. ‘protected works’ and ‘infringement’.[107]

The resultant effect that thus ensue ensures that intermediaries not only act as law makers but also as adjudicators for disputes.[108] Accordingly, the introduction of automated tools within the Indian legal framework facilitates possibilities of a parallel legal system that overlooks Indian laws in support of institutionalized interpretations and understandings put forward by corporations (i.e., intermediaries).[109] Interestingly, the Indian courts in Shreya Singhal v. Union of India had previously limited the rights of intermediaries to remove UGC in absence of a court order or notification.[110] In furtherance of which, courts have also held that the IT rules should not be deemed to “vest in intermediary(ies) suo motupowers to detect and refuse hosting of infringing contents.”[111] The rationale for the same is grounded in preventing the implicit sanctioning of intermediaries as assessors and adjudicators for instances of alleged contraventions.[112]

I. Conclusion

The advent of technology based automated tools by intermediaries creates a new paradigm shift for identifying unlawful UGC.[113] Traditionally, the impetus for resolving disputes have vested with the judiciary and the legislature.[114]However, with the inception of automated tools, the impetus has now shifted to allow adjudicatory-esque powers to the intermediaries.[115]

The current developments in India accurately reflect the shift in paradigm from an existing traditional understanding of dispute resolution towards the newer evolving approach of automated enforcement tools.[116] In pursuance of which, the paper has dwelled upon the socio-legal consequences of implementing such tools.[117] The 2021 Rules create a unique fork road in respect of the evolution of secondary liability for intermediaries within the Indian jurisprudence.[118] With multiple intermediaries across varied industries engaging technology-based automated tools, the creation of a parallel legal system has become more than a tentative possibility,[119] limiting user ability, creativity, and interaction within the said parallel, predefined, and predetermined processes of enforcement tools.[120] It is now more than pertinent for Indian and foreign jurisprudences alike to take into consideration the socio-legal consequences associated with automated tools (as highlighted in this paper), prior to policy enactments and sanctioning of such tools.[121]

