Viscount Colville of Culross - All Data (Use and Access) Act 2025 Contributions

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

2nd reading
Tuesday 19th November 2024

(1 year, 8 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

I, too, thank the Minister for her introduction to this welcome Bill. I feel that most noble Lords have an encyclopaedic knowledge of this subject, having been around the course not just once but several times. As a newcomer to this Bill, I am aware that I have plenty to learn from their experience. I would like to thank the Ada Lovelace Institute, the Public Law Project, Connected by Data and the Open Data Institute, among others, which have helped me get to grips with this complicated Bill.

Data is the oil of the 21st century. It is the commodity which drives our great tech companies and the material on which the large language models of AI are trained. We are seeing an exponential growth in the training and deployment of AI models. As many noble Lords have said, it has never been more important than now to protect personal data from being ruthlessly exploited by these companies, often without the approval of either the data owners or the creators. It is also important that, as we roll out algorithmic use of data, we ensure adequate protections for people’s data. I, too, hope this Bill will soon be followed by another regulating the development of AI.

I would like to draw noble Lords’ attention to a few areas of the Bill which cause me concern. During the debates over the last data protection Bill, I know there were worries over the weakening of data subjects’ protection and the loosening of processing of their data. The Government must be praised for losing many of these clauses, but I am concerned, like some other noble Lords, to ensure adequate safeguards for the new “recognised legitimate interests” power given to data processors. I support the Government’s growth agenda and understand that this power will create less friction for companies when using data for their businesses, but I hope that we will have time later in the passage of the Bill to scrutinise the exemption from the three tests for processing data, particularly the balancing test, which are so important in forcing companies to consider the data rights of individuals. This is especially so when safeguarding children and vulnerable people. The test must not be dropped at the cost of the rights of people whose data is being used.

This concern is reinforced by the ICO stating in its guidance that this test is valuable in ensuring companies do not use data in a way that data subjects would not reasonably expect it to be used. It would be useful in the Explanatory Notes to the Bill to state explicitly that when a data processor uses “recognised legitimate interests”, their assessment includes the consideration of proportionality of the processing activity. Does the Minister agree with this suggestion?

The list of four areas for this exemption has been carefully thought through, and I am glad that the category of democratic engagement has been removed. However, the clause does give future Ministers a Henry VIII power to extend the list. I am worried; I have heard some noble Lords say that they are as well, and that the clause’s inclusion in the previous Bill also concerned other noble Lords. It could allow future Ministers to succumb to commercial interests and add new categories, which might be to the cost of data subjects. The Minister, when debating this power in the previous data Bill, reminded the House that the Delegated Powers and Regulatory Reform Committee said of these changes:

“The grounds for lawful processing of personal data go to the heart of the data processing legislation and therefore in our view should not be capable of being changed by subordinate legislation”.

The Constitution Committee’s report called for the Secretary of State’s powers in this area to be subject to primary and not secondary legislation. Why do these concerns not apply to Clause 70 in this Bill?

I welcome the Government responding to the scientific community’s demand that they should be able to reuse data for scientific, historic or statistical research. There will be many occasions when data was collected for the study of a specific disease and the researchers want to reuse it years later for further study, but they have been restricted by the narrow distinctions between the original and the new purpose. The Government have incorporated recitals from the original GDPR in the Bill, but the changes in Clause 67 must be read against the developments taking place in AI and the way in which it is being deployed.

I understand that the Government have gone to great efforts to set out a clear definition of scientific research in this clause. One criterion is the

“processing for the purposes of technological development or demonstration … so far as those activities can reasonably be described as scientific”,

and another is the publication of scientific papers from the study. But my fear is that AI companies, in their urgent need to scrape datasets for training large language models, will go beyond the policy intention in this clause. They might posit that their endeavours are scientific and may even be supported by academic papers, but when this is combined with the inclusion of commercial activities in the Bill, it opens the way for data reuses in creating AI data-driven products which claim they are for scientific research. The line between product development and scientific research is blurred because of how little is understood about these emerging technologies. Maybe it would help if the Bill set out what areas of commercial activity should not be considered scientific research. Can the Minister share with the House how the clause will stop attempts by AI developers to claim they are doing scientific research when they are reusing data to increase model efficiency and capabilities, or studying their risks? They might even be producing scientific papers in the process.

I have attended a forum with scientists and policymakers from tech companies using the training data for AI who admitted that it is sometimes difficult to define the meaning of scientific research in this context. This concern is compounded by Clause 77, which provides an exemption to Article 13 of the UK GDPR for researchers and archivists to provide additional information to a data subject when reusing their data for different purposes if it requires disproportionate effort to obtain the required information. I understand these provisions are drawn to help reuse medical data, but they could also be used by AI developers to say that contacting people for the reuse of datasets from an already trained AI model requires disproportionate effort. I understand there are caveats around this exemption. However, in an era when AI companies are scraping millions of pieces of data to train their models, noble Lords need to bear in mind it is often difficult for them to get permission from the data subjects before reusing the information for AI purposes.

I am impressed by the safeguards for the exemption for medical research set out in Clause 85. The clause says that medical research should be supervised by a research ethics committee to assess the ethical reuse of the data. Maybe the Government should think about using some kind of independent research committee with standards set by UKRI before commercial researchers are allowed to reuse data.

Like many other noble Lords, I am concerned about the changes to Article 22 of the UK GDPR put forward in Clause 80. I quite understand why the Government want to expand solely automated decision-making in order for decisions to be made quickly and efficiently. However, these changes need to be carefully scrutinised. The clause removes the burden on the data controller to overcome tests before implementing ADM, outside of the use of sensitive information. The new position requires the data subject to proactively ask if they would like a human to be involved in the decision made about them. Surely the original Article 22 was correct in making the processor think hard before making a decision to use ADM, rather than putting the burden on the data subject. That must be the right way round.

There are other examples, which do not include sensitive data, where ADM decisions have been problematic. Noble Lords will know that, during Covid, algorithms were used to predict A-level results which, in many cases, were flawed. None of that information would have been classified as sensitive, yet the decisions made were wrong in too many cases.

Once again, I am concerned about the Henry VIII powers which have been granted to the Secretary of State in new Article 22D(1) and (2). This clause is already extending the use of ADM, but it gives Secretaries of State in the future the power to change by regulation the definition of “meaningful human involvement”. This potentially allows for an expansion of the use of ADM; they could water down the effectiveness of human involvement needed to be considered meaningful.

Likewise, I am worried by the potential for regulations to be used to change the definition of a decision having a “significant adverse effect” on a data subject. The risk is that this could be used to exclude them from the relevant protection, but the decision could nevertheless still have a significant harmful effect on the individual. An example would be if the Secretary of State decided to exclude from the scope of a “significant decision” interim, rather than final, decisions. This could result in the exclusion of a decision taken entirely on the basis of a machine learning predictive tool, without human involvement, to suspend somebody’s universal credit pending an investigation and final decision of whether fraud had actually been committed. Surely some of the anxiety about this potential extension of ADMs would be assuaged by increased transparency around how they are used. The Bill is a chance for the Government to give greater transparency to how ADMs process our information. The result would be to greatly increase public trust.

The Algorithmic Transparency Recording Standard delivers greater understanding about the nature of tools being used in the public sector. However, of the 55 ADM tools in operation, only 9 reports have currently been subject to the ATRS. In contrast, the Public Law Project’s Tracking Automated Government register has identified at least 55 additional tools, with many others still to be uncovered. I suggest that the Government make it mandatory for public bodies to publish information about the ADM systems that they are using on the ATRS hub.

Just as importantly, this is a chance for people to obtain personal information about how an automated decision is made. The result would be that, if somebody is subject to a decision made or supported by AI or an algorithmic tool, they should be notified at the time of the decision and provided with a personalised explanation of how and why it was reached.

Finally, I will look at the new digital verification services trust framework being set up in Part 2. The Government must be praised for setting up digital IDs, which will be so useful in the online world. My life, and I am sure that of many others, is plagued by the vagaries of getting access to the various websites we need to run our lives, and I include the secondary security on our phones, which so often does not work. The effectiveness of this ID will depend on the trust framework that is created and on who is involved in building it.

At the moment, in Clause 28, the Secretary of State must consult the Information Commissioner and such other persons as the Secretary of State sees appropriate. It seems to me that the DVS will be useful only if it can be used across national boundaries. Interoperability must be crucial in a digital world without frontiers. I suggest that an international standards body should be included in the Bill. The most obvious would be W3C, the World Wide Web Consortium, which is the standards body for web technology. It was founded by Sir Tim Berners-Lee and is already responsible for the development of a range of web standards, from HTML to CSS. More than that, it is used in the beta version of the UK digital identity and attributes trust framework and has played a role in both the EU and the Australian digital identity services frameworks. I know that the Government want the Secretary of State to have flexibility in drawing up this framework, but the inclusion of an international standards body in the Bill would ensure that the Minister has them in the forefront of their mind when drawing up this much-needed framework.

The Bill is a wonderful opportunity for our country to build public trust in data-driven businesses and their development. It is a huge improvement on its predecessor; it goes a long way to ensure that the law has protections for data subjects and sets out how companies can lawfully use and reuse data. It is just as crucial in the era of AI that, during the passage of the Bill through the House, we do not leave the door open for personal data to be ruthlessly exploited by the big tech companies. We would all be damaged if that was allowed to happen.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Committee stage
Tuesday 3rd December 2024

(1 year, 7 months ago)

Grand Committee

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Read Debate Ministerial Extracts Amendment Paper: HL Bill 40-I(a) Amendments for Grand Committee (Supplementary to the Marshalled List) - (3 Dec 2024)

Lord Clement-Jones (LD)

- Hansard - - - Excerpts

My Lords, I almost have a full house in this group, apart from Amendment 35, so I will not read out the numbers of all the amendments in this group. I should just say that I very much support what the noble Viscount, Lord Colville, has put forward in his Amendment 35.

Many noble Lords will have read the ninth report of the Delegated Powers and Regulatory Reform Committee. I am sad to say that it holds exactly the same view about this Bill as it did about the previous Bill’s provisions regarding digital verification services. It said that

“we remain of the view that the power conferred by clause 28 should be subject to parliamentary scrutiny, with the affirmative procedure providing the appropriate level of scrutiny”.

It is against that backdrop that I put forward a number of these amendments. I am concerned that, although the Secretary of State is made responsible for this framework, in reality, they cannot be accountable for delivering effective governance in any meaningful way. I have tried, through these amendments, to introduce at least some form of appropriate governance.

Of course, these digital verification provisions are long-awaited—the Age Verification Providers Association is pleased to see them introduced—but we need much greater clarity. How is the Home Office compliant with Part 2 of the Bill as it is currently written? How will these digital verification services be managed by DSIT? How will they interoperate with the digital identity verification services being offered by DSIT in the UK Government’s One Login programme?

Governance, accountability and effective, independent regulation are also missing. There is no mechanism for monitoring compliance, investigating malicious actors or taking enforcement action regarding these services. The Bill has no mechanism for ongoing monitoring or the investigation of compliance failures. The Government propose to rely on periodic certification being sufficient but I understand that, when pressed, DSIT officials say that they are talking to certification bodies and regulators about how they can do so. This is not really sufficient. I very much share the intention of both this Government and the previous one to create a market in digital verification services, but the many good players in this marketplace believe that high levels of trust in the sector depend on a high level of assurance and focus from the governance point of view. That is missing in this part of the Bill.

Amendment 33 recognises the fact that the Bill has no mechanism for ongoing monitoring or the investigation of compliance failures. As we have seen from the Grenfell public inquiry, a failure of governance caused by not proactively monitoring, checking and challenging compliance has real, harmful consequences. Digital verification services rely on the trustworthiness of the governance model; what is proposed is not trustworthy but creates material risk for UK citizens and parties who rely on the system.

There are perfectly decent examples of regulatory frameworks. PhonepayPlus provides one such example, with a panel of three experts supported by a secretariat; the panel can meet once a quarter to give its opinion. That has been dismissed as being too expensive, but I do not believe that any costings have been produced or that it has been considered how such a cost would weigh against the consequences of a failure in governance of the kind identified in recent public inquiries.

Again, as regards Amendment 36, there is no mechanism in the Bill whereby accountability is clearly established in a meaningful way. Accountability is critical if relying parties and end-users are to have confidence that their interests are safeguarded.

Amendment 38 is linked to Amendment 36. The review under Clause 31 must be meaningful in improving accountability and effective governance. The amendment proposes that the review must include performance, specifically against the five-year strategy and of the compliance, monitoring and investigating mechanisms. We would also like to see the Secretary of State held accountable by the Science and Technology Select Committee for the performance captured in the review.

On Amendment 41, the Bill is silent on how the Secretary of State will determine that there is a compliance failure. It is critical to have some independence and professional rigour included here; the independent appeals process is really crucial.

As regards Amendments 42 and 43, recent public inquiries serve to illustrate the importance of effective governance. Good practice for effective governance would require the involvement of an independent body in the determination of compliance decisions. There does not appear to be an investigatory resource or expertise within DSIT, and the Bill currently fails to include requirements for investigatory processes or appeals. In effect, there is no check on the authority of the Secretary of State in that context, as well as no requirement for the Secretary of State proactively to monitor and challenge stakeholders on compliance.

As regards Amendment 44, there needs to be a process or procedure for that; fairness requires that there should be a due process of investigation, a review of evidence and a right of appeal to an independent body.

I turn to Amendment 45 on effective governance. A decision by the appeals body that a compliance failure is so severe that removal from the register is a proportionate measure must be binding on the Secretary of State, otherwise there is a risk of lobbying and investment in compliance and service improvement being relegated below that of investment in lobbying. Malicious actors view weaknesses in enforcement as a green light and so adopt behaviours that both put at risk the safety and security of UK citizens and undermine the potential of trustworthy digital verification to drive economic growth.

Amendment 39 would exclude powers in this part being used by government as part of GOV.UK’s One Login.

I come on to something rather different in Amendment 46, which is very much supported by Big Brother Watch, the Digital Poverty Alliance and Age UK. Its theme was raised at Second Reading. A significant proportion of the UK’s population lacks internet access, with this issue disproportionately affecting older adults, children and those from low-income backgrounds. This form of digital exclusion presents challenges in an increasingly digital world, particularly concerning identity verification.

Although digital identity verification can be beneficial, it poses difficulty for individuals who cannot or choose not to engage digitally. Mandating online identity verification can create barriers for digitally excluded groups. For example, the National Audit Office found that only 20% of universal credit applicants could verify their identity online, highlighting concerns for those with limited digital skills. The Lords Communications and Digital Select Committee emphasised the need for accessible, offline alternatives to ensure inclusivity in a connected world. The proponents of this amendment advocate the availability of offline options for essential public and private services, particularly those requiring identity verification. This is crucial as forcing digital engagement can negatively impact the well-being and societal participation of older people.

This is the first time that I have prayed in aid what the Minister said during the passage of the Data Protection and Digital Information Bill; this could be the first of a few such occasions. When we debated the DPDI Bill, she stressed the importance of a legal right to choose between digital and non-digital identity verification methods. I entirely agreed with her at the time. She said that this right is vital for individual liberty, equality and building trust in digital identity systems and that, ultimately, such systems should empower individuals with choices rather than enforce digital compliance. That is a fair summary of what she said at the time.

I turn to Amendment 50. In the context of Clause 45 and the power of public authorities to disclose information, some of which may be the most sensitive information, it is important for the Secretary of State to be able to require the public authority to provide information on what data is being disclosed and where the data is going, as well as why the data is going there. This amendment will ensure that data is being disclosed for the right reasons, to the right places and in the right proportion. I beg to move.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, I tabled Amendment 35 because I want to make the DVS trust framework as useful as possible. I support Amendment 33 in the name of the noble Lord, Lord Clement-Jones, and Amendment 37 in the name of the noble Viscount, Lord Camrose.

The framework’s mandate is to define a set of rules and standards designed to establish trust in digital identity products in the UK. It is what I would hope for as a provision in this Bill. As the Minister told us at Second Reading, the establishment of digital ID services with a trust mark will increase faith in the digital market and reduce physical checks—not to mention reducing the time spent on a range of activities, from hiring new workers to moving house. I and many other noble Lords surely welcome the consequent reduction in red tape, which so often impedes the effectiveness of our public services.

Clause 28(3) asks the Secretary of State to consult the Information Commissioner and such persons as they consider appropriate. However, in order to ensure that these digital ID services are used and recognised as widely as possible—and, more importantly, that they can be used by organisations beyond our borders— I suggest Amendment 35, which would include putting consultation with an international digital standards body in the Bill. This amendment is supported by the Open Data Institute.

I am sure that the Minister will tell me that that amendment is unnecessary as we can leave it to the common sense of Ministers and civil servants in DSIT to consult such a body but, in my view, it is helpful to remind them that Parliament thinks the consultation of an international standards body is important. The international acceptance of DVS is crucial to its success. Just like an email, somebody’s digital identity should not be tied to a company or a sector. Imagine how frustrating it would be if we could only get Gmail in the UK and Outlook in the EU. Imagine if, in a world of national borders and jurisdictions, you could not send emails between the UK and the EU as a result. Although the DVS will work brilliantly to break down digital identity barriers in the UK, there is a risk that no international standards body might be consulted in the development of the DVS scheme. This amendment would be a reminder to the Secretary of State that there must be collaboration between this country, the EU and other nations, such as Commonwealth countries, that are in the process of developing similar schemes.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Committee stage
Tuesday 10th December 2024

(1 year, 7 months ago)

Grand Committee

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Read Debate Ministerial Extracts Amendment Paper: HL Bill 40-II(a) Amendment for Grand Committee (Supplementary to the Second Marshalled List) - (9 Dec 2024)

Moved by

Viscount Colville of Culross

- Hansard - -

59: Clause 67, page 75, line 9, after “processing” insert “solely”.
Member’s explanatory statement
This amendment prevents misuse of the scientific research exceptions for data reuse by ensuring that the only purpose for which the reuse is permissible is for the scientific research—with no additional purposes.

Viscount Colville of Culross (CB)

- Hansard - -

I have tabled Amendments 59, 62, 63 and 65, and I thank the noble Lord, Lord Clement-Jones, my noble friend Lady Kidron and the noble Viscount, Lord Camrose, for adding their names to them. I am sure that the Committee will agree that these amendments have some pretty heavyweight support. I also support Amendment 64, in the name of the noble Lord, Lord Clement-Jones, which is an alternative to my Amendment 63. Amendments 68 and 69 in this group also warrant attention.

I very much support the Government’s aim in Clause 67 to ensure that valuable research does not get discarded due to a lack of clarity around its use or because of an overly narrow distinction between the original and new purposes of the use of the data. The Government’s position is that this clause clarifies the law by incorporating into the Bill recitals to the original GDPR. However, while the effect is to encourage scientific research and development, it has to be seen in the context of the fast-evolving world of developments in AI and the way that AI developers, given the need for huge amounts of data to train their large language models, are reusing data.

My concern is that the scraping of vast amounts of data by these AI companies is often positioned as scientific research and in some cases is even supported by the production of academic papers. I ask the Minister to understand my concerns and those of many in the data community and beyond. The fact is that the lines between scientific research, as set out in Clause 67, and AI product development are blurred. This might not be the concern of the original recitals, but I beg to suggest to the Minister that, in the new world of AI, there should be concern about the definition presented in the Bill.

Like other noble Lords, I very much hope to make this country a centre of AI development, but I do not want this to happen at the expense of data subjects’ privacy and data protection. It costs at least £1 billion—even more, sometimes—to develop a large language model and, although the cost will soon go down, there is a huge financial incentive to scrape data that pushes the boundaries of what is legitimate. In this climate, it is important that the Bill closes any loopholes that allow AI developers to claim the protections offered by Clause 67. My Amendments 59, 62, 63 and 65 go some way to ensuring that this will not happen.

The definition of scientific research in proposed new paragraph 2, in Clause 67(1)(b), is drawn broadly. My concern is that many commercial developments of digital products, particularly those involving AI, could still claim to be, in the words of the clause, “reasonably … described as scientific”. AI model development usually involves a mix of purposes—not just developing its capabilities but also commercialising as it develops services. The exemption allowed for “purposes of technological development” makes me concerned that this vague area creates a threat whereby AI developers will misuse the provisions of the Bill to reuse personal data for any AI developments, provided that one of their goals is technological advancement.

Amendments 59 and 62, by inserting the word “solely” into proposed new paragraphs 2 and 3 in Clause 67, would disaggregate reuse of data for scientific research purposes from other purposes, ensuring that the only goal of reuse is scientific research.

An example of the threat under the present definition is shown by Meta’s recently allowing the reuse of Instagram users’ data to train its new generation of Llama models. When the news got out, it created a huge backlash, with more than half a million people reposting a viral hoax image that claimed to deny Meta the right to reuse their data to train AI. This caused the ICO to say that it was pleased that Meta had paused its data processing in response to users’ concerns, adding:

“It is crucial that the public can trust that their privacy rights will be respected from the outset”.

However, Meta could well claim under this clause that it is creating technological advancement which would allow it to reuse any data collected by users under the legitimate interest grounds for training the model. The Bill as it stands would not require the company to conduct its research in accordance with any of the features of genuine scientific research. These amendments go some way to rectify that.

Amendment 63 increases the test for what is deemed to be scientific interest. At the moment, the public interest test is applied only to public health. I am pleased that NHS researchers will have to recognise this threshold, but why should all researchers doing scientific work not have to adhere to this threshold? Why should that test not be applied to all data reuse for scientific research? By deleting the public health exception, the public interest test would apply to all data reuse for scientific purposes.

The original intention of the RAS purpose of the GDPR supports public health for scientific interests. This is complemented by Amendment 65, which uses the tests for consent already laid out in Clause 68. The inclusion of ethical thresholds in the reuse of data should meet the highest levels of academic rigour and oversight envisaged in the original GDPR. It will demand not just ethical standards in research but for it to be supervised by an independent research ethics committee that meets UKRI guidance. These requirements will ensure that the high standards of ethics that we expect from scientific research will be applied in evaluating the exemption in Clause 67.

I do not want noble Lords to think that these amendments are thwarting the development of AI. There is plenty of AI research that is clearly scientific. Look at DeepMind AlphaFold, which uses AI to analyse the shape of proteins so that they can be incorporated in future drug treatment and will move pharmaceutical development. It is an AI model developed in accordance with the ethical standards expected from modern scientific research.

The Minister will argue that the definition has been taken straight from EU recitals. I therefore ask her to consider very seriously what has been said about this definition by the EU’s premier data body, the European Data Protection Supervisor, in its preliminary opinion on data protection and scientific research. In its executive summary, it states:

“The boundary between private sector research and traditional academic research is blurrier than ever, and it is ever harder to distinguish research with generalisable benefits for society from that which primarily serves private interests. Corporate secrecy, particularly in the tech sector, which controls the most valuable data for understanding the impact of digitisation and specific phenomena like the dissimilation of misinformation, is a major barrier to social science research … there have been few guidelines or comprehensive studies on the application of data protection rules to research”.

It suggests that the rules should be interpreted in such a way that permits reuse only for genuine scientific research.

For the purpose of this preliminary opinion by the EDPS, the special data protection regime for scientific research is understood to apply if each of three criteria are met: first, personal data is processed; secondly, relevant sectorial standards of methodology and ethics apply, including the notion of informed consent, accountability and oversight; and, thirdly, the research is carried out with the aim of growing society’s collective knowledge and well-being as opposed to serving primarily one or several private interests. I hope that noble Lords will recognise that these are features that the amendments before the Committee would incorporate into Clause 67.

In the circumstances, I hope that the Minister, who I know has thought deeply about these issues, will recognise that the EU’s institutions are worried about the definition of scientific research that has been incorporated into the Bill. If they are worried, I suggest that we should be worried. I hope that these amendments will allay those fears and ensure that true scientific research is encouraged by Clause 67 and that it is not abused by AI companies. I beg to move.

--- Later in debate ---

I know that we have gone around this subject in a very wide sense and that we might equally revisit some of these issues on other amendments. But I hope that, for the moment, I have reassured noble Lords on the specific details of their amendments and persuaded them that those strong protections are in place.

Viscount Colville of Culross (CB)

- Hansard - -

I thank the Minister very much, but is she not concerned by the preliminary opinion from the EDPS, particularly that traditional academic research is blurrier than ever and that it is even harder to distinguish research which generally benefits society from that which primarily serves private interest? People in the street would be worried about that and the Bill ought to be responding to that concern.

Baroness Jones of Whitchurch (Lab)

- Hansard - - - Excerpts

I have not seen that observation, but we will look at it. It goes back to my point that the provisions in this Bill are designed to be future facing as well as for the current day. The strength of those provisions will apply regardless of the technology, which may well include AI. Noble Lords may know that we will bring forward a separate piece of legislation on AI, when we will be able to debate this in more detail.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, this has been a very important debate about one of the most controversial areas of this Bill. My amendments are supported across the House and by respected civic institutions such as the Ada Lovelace Institute. I understand that the Minister thinks they will stifle scientific research, particularly by nascent AI companies, but the rights of the data subject must be borne in mind. As it stands, under Clause 67, millions of data subjects could find their information mined by AI companies, to be reused without consent.

The concerns about this definition being too broad were illustrated very well across the Committee. The noble Lord, Lord Clement-Jones, said that it was too broad and must recognise that AI development will be open to using data research for any AI purposes and talked about his amendment on protecting children’s data, which is very important and worthy of consideration. This was supported by my noble friend Lady Kidron, who pointed out that the definition of scientific research could cover everything and warned that Clause 67 is not just housekeeping. She quoted the EDPS and talked about its critical clarification not being included in the transfer of the scientific definition into the Bill. The noble Lord, Lord Holmes, asked what in the Bill has changed when you consider how much has changed in AI. I was very pleased to have the support of the noble Viscount, Lord Camrose, who warned against the abuse and misuse of data and the broad definition in this Bill, which could muddy the waters. He supported the public interest test, which would be fertile ground for helping define scientific data.

Surely this Bill should walk the line in encouraging the AI rollout to boost research and development in our science sector. I ask the Minister to meet me and other concerned noble Lords to tighten up Clauses 67 and 68. On that basis, I beg leave to withdraw my amendment.

Amendment 59 withdrawn.

--- Later in debate ---

Viscount Camrose (Con)

- Hansard - - - Excerpts

My Lords, Amendments 66, 67 and 80 in this group are all tabled in my name. Amendment 66 requires scientific research carried out for commercial purposes to

“be subject to the approval of an independent ethics committee”.

Commercial research is, perhaps counterintuitively, generally subjected to fewer ethical safeguards than research carried out purely for scientific endeavour by educational institutions. Given the current broad definition of scientific research in the Bill—I am sorry to repeat this—which includes research for commercial purposes, and the lower bar for obtaining consent for data reuse should the research be considered scientific, I think it would be fair to require more substantial ethical safeguards on such activities.

We do not want to create a scenario where unscrupulous tech developers use the Bill to harvest significant quantities of personal data under the guise of scientific endeavour to develop their products, without having to obtain consent from data subjects or even without them knowing. An independent ethics committee would be an excellent way to monitor scientific research that would be part of commercial activities, without capping data access for scientific research, which aims more purely to expand the horizon of our knowledge and benefit society. Let us be clear: commercial research makes a huge and critically important contribution to scientific research, but it is also surely fair to subject it to the same safeguards and scrutiny required of non-commercial scientific research.

Amendment 67 would ensure that data controllers cannot gain consent for research purposes that cannot be defined at the time of data collection. As the Bill stands, consent will be considered obtained for the purposes of scientific research if, at the time consent is sought, it is not possible to identify fully the purposes for which the personal data is to be processed. I fully understand that there needs to be some scope to take advantage of research opportunities that are not always foreseeable at the start of studies, particularly multi-year longitudinal studies, but which emerge as such studies continue. I am concerned, however, that the current provisions are a little too broad. In other words: is consent not actually being given at the start of the process for, effectively, any future purpose?

Amendment 80 would prevent the data reuse test being automatically passed if the reuse is for scientific purposes. Again, I have tabled this amendment due to my concerns that research which is part of commercial activities could be artificially classed as scientific, and that other clauses in the Bill would therefore allow too broad a scope for data harvesting. I beg to move.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, it seems very strange indeed that Amendment 66 is in a different group from group 1, which we have already discussed. Of course, I support Amendment 66 from the noble Viscount, Lord Camrose, but in response to my suggestion for a similar ethical threshold, the Minister said she was concerned that scientific research would find this to be too bureaucratic a hurdle. She and many of us here sat through debates on the Online Safety Bill, now an Act. I was also on the Communications Committee when it looked at digital regulations and came forward with one of the original reports on this. The dynamic and impetus which drove us to worry about this was the lack of ethics within the tech companies and social media. Why on earth would we want to unleash some of the most powerful companies in the world on reusing people’s data for scientific purposes if we were not going to have an ethical threshold involved in such an Act? It is important that we consider that extremely seriously.

Lord Clement-Jones (LD)

- Hansard - - - Excerpts

My Lords, I welcome the noble Viscount to the sceptics’ club because he has clearly had a damascene conversion. It may be that this goes too far. I am slightly concerned, like him, about the bureaucracy involved in this, which slightly gives the game away. It could be seen as a way of legitimising commercial research, whereas we want to make it absolutely certain that that research is for the public benefit, rather than imposing an ethical board on every single aspect of research which has any commercial content.

We keep coming back to this, but we seem to be degrouping all over the place. Even the Government Whips Office seems to have given up trying to give titles for each of the groups; they are just called “degrouped” nowadays, which I think is a sign of deep depression in that office. It does not tell us anything about what the different groups contain, for some reason. Anyway, it is good to see the noble Viscount, Lord Camrose, kicking the tyres on the definition of the research aspect.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Committee stage
Monday 16th December 2024

(1 year, 7 months ago)

Grand Committee

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Read Debate Ministerial Extracts Amendment Paper: HL Bill 40-III(a) Amendments for Grand Committee (Supplementary to the Third Marshalled List) - (13 Dec 2024)

Moved by

Viscount Colville of Culross

- Hansard - -

92: Clause 77, page 91, line 5, leave out “the number of data subjects,”
Member’s explanatory statement
This amendment reduces the likelihood of misuse of Clause 77 by AI model developers, who may otherwise seek to claim they do not need to notify data subjects of reuse for scientific purposes under Clause 77 because of the way that personal data is typically collected and processed for AI development, for example by scraping large amounts of personal data from the internet.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, I have tabled Amendments 92, 93, 101 and 105, and I thank the noble Lord, Lord Clement-Jones for adding his name to them. I also support Amendment 137 in the name of my noble friend Lady Kidron.

Clause 77 grants an exemption to the Article 13 and 14 rights of data subjects to be told within a set timeframe that their data will be reused for scientific research, if it would be impossible or involve disproportionate effort to do so. These amendments complement those I proposed to Clause 67. They aim to ensure that “scientific research” is limited in its definition and that the large language AI developers cannot say that they are doing scientific research and that the GDPR requirements involve too much effort to have to contact data subjects to reuse their data.

It costs AI developers time and money to identify data subjects, so this exemption is obviously very valuable to them and they will use it if possible. They will claim that processing and notifying data subjects from such a huge collection of data is a disproportionate effort, as it is hard to extract the identity of data subjects from the original AI model.

Up to 5 million data subjects could be involved in reusing data to train a large language model. However, the ICO requires data controllers to inform subjects that their data could be reused even if it involves contacting 5 million data subjects. The criteria set out in proposed new subsection (6) in Clause 77 play straight into the hands of ruthless AI companies that want to take advantage of this exemption.

Amendments 92 and 101 would ensure that the disproportionate effort excuse is not used if the number of data subjects is mentioned as a reason for deploying the excuse. Amendments 93 and 105 would clarify the practices and facts that would not qualify for the disproportionate effort exemption—namely,

“the fact the personal data was not collected from the data subject, or any processing undertaken by the controller that makes the effort involved greater”.

Without this wording, the Bill will mean that the data controller, when wanting to reuse data for training another large language model, could process the personal data on the original model and then reuse it without asking permission from the original subjects. The AI developer could say, “I don’t have the original details of the data subject, as they were deleted when the original model was trained. There was no identification of the original data subjects; only the data weight”. I fear that many companies will use this excuse to get around GDPR notification expectations.

Noble Lords should recognise that these provisions affect only AI developers seeking to reuse data under the scientific research provisions. These will mainly be the very large AI developers, which tend to use scrape data to train their general purpose models. Controllers will still be able to use personal data to train AI systems when they have lawful grounds to do so—they either have the consent of the data subject or there is a legitimate interest—but I want to make it clear that these provisions will not inhibit the legitimate training of AI models.

These amendments would ensure that organisations, especially large language AI developers, are not able to reuse data at scale, in contradiction to the expectations and intentions of data subjects. Failure to get this right will risk setting off a public backlash against the use of personal data for AI use, which would impede this Government’s aims of making this country an AI superpower. I beg to move.

--- Later in debate ---

Baroness Jones of Whitchurch (Lab)

- Hansard - - - Excerpts

Discussions with the ICO are taking place at the moment about the scope and intention of a number of issues around AI, and this issue would be included in that. However, I cannot say at the moment that that intention is specifically spelled out in the way that the noble Baroness is asking.

Viscount Colville of Culross (CB)

- Hansard - -

This has been a wide-ranging debate, with important contributions from across the Committee. I take some comfort from the Minister’s declaration that the exemptions will not be used for web crawling, but I want to make sure that they are not used at the expense of the privacy and control of personal data belonging to the people of Britain.

That seems particularly so for Amendment 137 in the name of the noble Baroness, Lady Kidron. I was particularly taken by her pointing out that children’s data privacy had not been taken into account when it came to AI, reinforced by the noble Baroness, Lady Harding, telling us about the importance of the Bill. She said it was paramount to protect children in the digital age and reminded us that this is the biggest breakthrough of our lifetime and that children need protecting from it. I hope very much that there will be some successful meetings, and maybe a government amendment on Report, responding to these passionate and heartfelt demands. On that basis, I sincerely hope the Minister will meet us all and other noble Lords to discuss these matters of data privacy further. On that basis, I beg leave to withdraw my amendment.

Amendment 92 withdrawn.

--- Later in debate ---

The amendment could lead to the introduction of measures to ensure private sector assessment and monitoring of impacts on work, people and fundamental rights, which would conform to the framework convention. If this does not do what the Government intend as regards adoption in the UK of that framework convention, I very much hope that the Government can give us more information about that at this time. I beg to move.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, Amendment 119 is in my name, and I thank the noble Lord, Lord Knight, for adding his name to it. I am pleased to add my name to Amendment 115A in the name of noble Viscount, Lord Camrose.

Transparency is key to ensuring that the rollout of ADM brings the public and, most importantly, public trust with it. I give the Committee an example of how a lack of transparency can erode that trust. The DWP is using a machine learning model to analyse all applications for a loan, as an advance on a benefit to pay bills and other costs, while a recipient waits for their first universal credit payment. The DWP’s own analysis of the model concluded that for all of the protected characteristics that were analysed, including age, marital status and disability, it found disparities in who was most likely to be incorrectly referred by the model.

It is difficult to assess whether the model is discriminatory, effective or even lawful. When the DWP rolled it out, it was unable to reassure the Comptroller and Auditor-General that its anti-fraud models treated all customer groups fairly. The rollout continues despite these concerns. The DWP maintains that the analysis does not present

“any immediate concerns of discrimination, unfair treatment or detrimental impact on customers”.

However, because so little information is available about the model, this claim cannot be independently verified to provide the public with confidence. Civil rights organisations, including the Public Law Project, are currently working on a potential claim against the DWP, including in relation to this model, on the basis that they may consider it may be unlawful.

The Government’s commitment to rolling out ADM has been accompanied by a statement in the other place in November by AI Minister Feryal Clark that the mandatory requirement for the use of the ATRS has been seen as a significant acceleration towards adopting the standard. In response to a Written Question, the Secretary of State confirmed that, as part of the rollout of ADM phase 1 to the 16 largest ministerial departments plus HMRC, there is a deadline for them to publish their first ATRS records by the end of July 2024. Despite the Government’s statement, only eight ATRS reports have been published on the hub. The Public Law Project’s TAG project has discovered at least 74 areas in which ADM is being used, and they are only the ones that it has been able to uncover by freedom of information requests and from tip-offs by affected people. There is clearly a shortfall in the implementation and rolling out of the use of the ATRS across government departments.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Committee stage
Wednesday 18th December 2024

(1 year, 7 months ago)

Grand Committee

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Read Debate Ministerial Extracts Amendment Paper: HL Bill 40-IV Fourth marshalled list for Grand Committee - (17 Dec 2024)

Finally, at the recent All-Party Parliamentary Group for Writers reception, we heard a moving speech by the author Joanne Harris, who made perhaps the most important point. She said that to a lot of the public, as soon as you utter the words “artificial intelligence”, people still think it is science fiction. It is not science fiction. As Joanne Harris and others have pointed out, it is happening now and happening in a big way. The Government need to deal with these concerns both urgently and effectively.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords I have been very impressed by the speeches of my noble friends Lady Kidron and Lord Freyberg, so I will be very brief. I declare in interest as a television producer who produces content. I hope that it has not been scraped up by AI machines, but who knows? I support the amendments in this group.

I know that AI is going to solve many problems in our economy and our society. However, in their chase for the holy grail of promoting AI, I join other noble Lords in asking the Government not to push our creative economy under the bus. It is largely made up of SMEs and single content producers, who do not have the money to pursue powerful AI companies to get paid for the use of their content in training their AI models. It is up to noble Lords to help shape regulations that protect our data and copyright laws and can be fully deployed in the defence of the creative economy.

I too have read the Government’s Copyright and Artificial Intelligence consultation paper, published yesterday. The foreword says:

“The proposals include a mechanism for rights holders to reserve their rights”,

which I, like my noble friend Lady Kidron and others, interpret as meaning that creators’ works can be used by AI developers unless they opt out and require licensing for the use of their work. The Government are following the EU example and going for the opt-out model. I think that the European Union is beginning to realise that it is very difficult to make that work, and it brings an unfairness to content producers. Surely, the presumption should be that AI web crawlers should get agreement before using content. The real problem is that content producers do not even know when their content has been used. Even the AI companies sometimes do not know what content has been used. Surely, the opt-out measure is like having your house raided and then asking the burglar what he has taken.

I call on the Minister to work with us to create an opt-in regime. Creators’ works should be used only when already licensed by the AI companies. The companies say they usually do not use content, only data points. Surely that is like saying to a photographer, “We’ve used 99% of the pixels in a picture but not the whole picture”. If even one pixel is used, the photographer needs to know and be compensated.

The small companies and single content producers of our country are the backbone of our economy, as other noble Lords have said. They are threatened by this technology, in which we have placed so much faith. I ask the Minister to respond favourably to Amendments 204, 205 and 206 to ensure that we have fairness between some of the biggest AI players in the world and the hard-pressed people who create content.

Lord Hampton (CB)

- Hansard - - - Excerpts

My Lords, I support Amendments 204, 205 and 206 in the names of my noble friends Lady Kidron and Lord Freyberg, and of the noble Lords, Lord Stevenson and Lord Clement-Jones, in what rapidly seems to be becoming the Cross-Bench creative club.

I spent 25 years as a professional photographer in London from the late 1980s. When I started, retouchers would retouch negatives and slides by hand, charging £500 an hour. Photoshop stopped that. Professional film labs such as Joe’s Basement and Metro would work 24 hours a day. Snappy Snaps and similar catered for the amateur market. Digital cameras stopped that. Many companies provided art prints, laminating and sundry items for professional portfolios. PDFs and websites stopped that. Many different forms of photography, particularly travel photography, were taken away when picture libraries cornered the market and drove down commissions to unsustainable levels. There were hundreds if not thousands of professional photographers in the country. The smartphone has virtually stopped that.

All these changes were evolution and the result of a world becoming more digitised, but AI web crawlers are different, illegally scraping images without consent or payment then potentially killing the trade of the victim by setting up in competition. This is a parasite, but not in the true sense, because a parasite is careful to keep its victims alive.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Science, Innovation & Technology

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Science, Innovation & Technology

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Report stage
Tuesday 21st January 2025

(1 year, 6 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts Amendment Paper: HL Bill 57-I Marshalled list for Report - (17 Jan 2025)

Moved by

Viscount Colville of Culross

- View Speech - Hansard - -

14: Clause 67, page 75, line 10, after “scientific” insert “and that is conducted in the public interest”
Member’s explanatory statement
This amendment ensures that to qualify for the scientific research exception for data reuse, that research must be in the public interest. This requirement already exists for medical research, but this amendment would apply it to all scientific research wishing to take advantage of the exception.

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, I thank my noble friend Lady Kidron and the noble Viscount, Lord Camrose, for adding their signatures to my Amendment 14. I withdrew this amendment in Committee, but I am now asking the Minister to consider once again the definition of “scientific research” in the Bill. If he cannot satisfy me in his speech this evening, I will seek the opinion of the House.

I have been worried about the safeguards for defining scientific research since the Bill was published. This amendment will require that the research should be in “the public interest”, which I am sure most noble Lords will agree is a laudable aim and an important safeguard. This amendment has been looked at in the context of the Government’s recent announcements on turning this country into an AI superpower. I am very much a supporter of this endeavour, but across the country there are many people who are worried about the need to set up safeguards for their data. They fear data safety is threatened by this explosion of AI and its inexorable development by the big tech companies. This amendment will go some way to building public trust in the AI revolution.

The vision of Donald Trump surrounded at his inauguration yesterday by tech billionaires, most of whom have until recently been Democrats, puts the fear of God into me. I fear their companies are coming for our data. We have some of the best data in the world, and it needs to be safeguarded. The AI companies are spending billions of dollars developing their foundation models, and they are beholden to their shareholders to minimise the cost of developing these models.

Clause 67 gives a huge fillip to the scientific research community. It exempts research which falls within the definition of scientific research as laid out in the Bill from having to gain new consent from data subjects to reuse millions of points of data.

It costs time and money for the tech companies to get renewed consent from data holders before reusing their data. This is an issue we will discuss further when we debate amendments on scraping data from creatives without copyright licensing. It is clear from our debates in Committee that many noble Lords fear that AI companies will do what they can to avoid either getting consent or licensing data for use in scraping data. Defining their research as scientific will allow them to escape these constraints. I could not be a greater supporter of the wonderful scientific research that is carried out in this country, but I want the Bill to ensure that it really is scientific research and not AI development camouflaged as scientific research.

The line between product development and scientific research is often blurred. Many developers posit efforts to increase model capabilities, efficiency, or indeed the study of their risks, as scientific research. The balance has to be struck between allowing this country to become an AI superpower and exploiting its data subjects. I contend that this amendment will go far to allay public fears of the abuse and use of their data to further the profits and goals of huge AI companies, most of which are based in the United States.

Noble Lords have only to look at the outrage last year at Meta’s use of Instagram users’ data without their consent to train the datasets for its new Llama AI model to understand the levels of concern. There were complaints to regulators, and the ICO posted that Meta

“responded to our request to pause and review plans to use Facebook and Instagram user data to train generative AI”.

However, so far, there has been no official change to Meta’s privacy policy that would legally bind it to stop processing data without consent for the development of its AI technologies, and the ICO has not issued a binding order to stop Meta’s plans to scrape users’ data to train its AI systems. Meanwhile, Meta has resumed reusing subjects’ data without their consent.

I thank the Minister for meeting me and talking through Amendment 14. I understand his concerns that, at a public interest threshold, the definition of scientific research will create a heavy burden on researchers, but I think it is worth the risk in the name of safety. Some noble Lords are concerned about the difficulty of defining “public interest”. However, the ICO has very clear guidelines about what public interest consists of. It states that

“you should broadly interpret public interest in the research context to include any clear and positive public benefit likely to arise from that research”.

It continues:

“The public interest covers a wide range of values and principles about the public good, or what is in society’s best interests. In making the case that your research is in the public interest, it is not enough to point to your own private interests”.

The guidance even includes further examples of research in the public interest, such as

“the advancement of academic knowledge in a given field … the preservation of art, culture and knowledge for the enrichment of society … or … the provision of more efficient or more effective products and services for the public”.

This guidance is already being applied in the Bill to sensitive data and public health data. I contend that if these carefully thought-through guidelines are good enough for health data, they should be good enough for all scientific data.

This view is supported in the EU, where

“the special data protection regime for scientific research is understood to apply where … the research is carried out with the aim of growing society’s collective knowledge and wellbeing, as opposed to serving primarily one or several private interests.”

The Minister will tell the House that the data exempted to be used for scientific research is well protected—that it has both the lawfulness test, as set out in the UK GDPR, and a reasonableness test. I am concerned that the reasonableness test in this Bill references

“processing for the purposes of any research that can reasonably be described as scientific, whether publicly or privately funded and whether carried out as a commercial or non-commercial activity”.

Normally, a reasonableness test requires an expert in the context of that research to decide whether it is reasonable to consider it scientific. However, in this Bill, “reasonable” just means that an ordinary person in the street can decide whether the research is reasonable to be considered scientific. This must be a broadening of the threshold of the definition.

It seems “reasonable” in the current climate to ask the Government to include a public interest test before giving the AI companies extensive scope to reuse our data, without getting renewed consent, on the pretext that the work is for scientific research. In the light of possible deregulation of the sector by the new regime in America, it is beholden on this country to ensure that our scientific research is dynamic, but safe. If the Government can bring this reassurance then for millions of people in this country they will increase trust in Britain’s AI revolution. I beg to move.

Baroness Kidron (CB)

- View Speech - Hansard - - - Excerpts

My Lords, I support my noble friend Lord Colville. He has made an excellent argument, and I ask noble Lords on the Government Benches to think about it very carefully. If it is good enough for health data, it is good enough for the rest of science. In the interest of time, I will give an example of one of the issues, rather than repeat the excellent argument made by my noble friend.

In Committee, I asked the Government three times whether the cover of scientific research could be used, for example, to market-test ways to hack human responses to dopamine in order to keep children online. In the Minister’s letter, written during Committee, she could not say that the A/B testing of millions of children to make services more sticky—that is, more addictive—would not be considered scientific, but rather that the regulator, the ICO, could decide on a case-by-case basis. That is not good enough.

There is no greater argument for my noble friend Lord Colville’s amendment than the fact that the Government are unable to say if hacking children’s attention for commercial gain is scientific or not. We will come to children and child protection in the Bill in the next group, but it is alarming that the Government feel able to put in writing that this is an open question. That is not what Labour believed in opposition, and it is beyond disappointing that, now in government, Labour has forgotten what it then believed. I will be following my noble friend through the Lobby.

--- Later in debate ---

I hope the noble Viscount is content to withdraw this amendment, given these reassurances and the concerns about a significant unintended consequence from going down this route.

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

My Lords, I am grateful and impressed that the Minister has stepped into this controversial sphere of data management at such short notice. I wish his colleague, the noble Baroness, Lady Jones, a swift recovery.

I hope that noble Lords listened to the persuasive speeches that were given across the Benches, particularly from my noble friend Lady Kidron, with her warning about blurring the definition of scientific research. I am also grateful to the Opposition Benches for their support. I am glad that the noble Lord, Lord Markham, thinks that I am threading the needle between research and public trust.

I listened very carefully to the Minister’s response and understand that he is concerned by the heavy burden that this amendment would put on scientific research. I have listened to his explanation of the OECD Frascati principles, which define scientific research. I understand his concern that the rigorous task of demanding that new researchers have to pass a public interest test will stop many from going ahead with research. However, I repeat what I said in my opening speech: there has to be a balance between generating an AI revolution in this country and bringing the trust of the British people along with it. The public interest test is already available for restricted research in this field; I am simply asking for it to be extended to all scientific research.

I am glad that the reasonableness and lawfulness tests are built into Clause 67, but I ask for a test that I am sure most people would support—that the research should have a positive public benefit. On that note, I would like to seek the opinion of the House.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Science, Innovation & Technology

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Science, Innovation & Technology

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Report stage
Tuesday 21st January 2025

(1 year, 6 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts Amendment Paper: HL Bill 57-I Marshalled list for Report - (17 Jan 2025)

Baroness Freeman of Steventon (CB)

- View Speech - Hansard - - - Excerpts

My Lords, I support Amendment 34 from the noble Lord, Lord Clement-Jones, and will speak to my own Amendment 35, which amends it. When an algorithm is being used to make important decisions about our lives, it is vital that everyone is aware of what it is doing and what data it is based on. On Amendment 34, I know from having had responsibility for algorithmic decision support tools that users are very interested in how recent the data it is based on is, and how relevant it is to them. Was the algorithm derived from a population that included people who share their characteristics? Subsection (1)(c)(ii) of the new clause proposed in Amendment 34 refers to regular assessment of the data used by the system. I would hope that this would be part of the meaningful explanation to individuals to be prescribed by the Secretary of State in subsection (1)(b).

Amendment 35 would add to this that it is vital that all users and procurers of such a system understand its real-world efficacy. I use the word “efficacy” rather than “accuracy” because it might be difficult to define accuracy with regard to some of these systems. The procurer of any ADM system should want to know how accurate it is using realistic testing, and users should also be aware of those findings. Does the system give the same outcome as a human assessor 95% or 60% of the time? Is that the same for all kinds of queries, or is it more accurate for some groups of people than others? The efficacy is really one of the most important aspects and should be public. I have added an extra line that ensures that this declaration of efficacy would be kept updated. One would hope that the performance of any such system would be monitored anyway, but this ensures that the outcomes of such monitoring are in the public domain.

In Committee, the Minister advised us to wait for publication of the algorithmic transparency records that were released in December. Looking at them, I think they make clear the much greater need for guidance and stringency in what should be mandated. I will give two short examples from those records. For the DBT: Find Exporters algorithm, under “Model performance” it merely says that it uses Brier scoring and other methods, without giving any actual results of that testing to indicate how well it performs. It suggests looking at the GitHub pages. I followed that link, and it did not allow me in. The public have no access to those pages. This is why these performance declarations need to be mandated and forced to be in the public domain.

In the second example, the Cambridgeshire trial of an externally supplied object detection system just cites the company’s test data, claiming average precision in a “testing environment” of 43.5%. This does not give the user a lot of information. Again, it links to GitHub pages produced by the supplier. Admittedly, this is a trial, so perhaps the Cambridgeshire Partnership will update it with its real-world trial data. But that is why we need to ensure annual updates of performance data and ensure that that data is not just a report of the supplier’s claims in a test environment.

The current model of algorithmic transparency records is demonstrably not fit for purpose, and these provisions would help put them on a much firmer footing. These systems, after all, are making life-changing decisions for all of us and we all need to be sure how well they are doing and put appropriate levels of trust in them accordingly.

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

My Lords, I have added my name to Amendment 36 tabled by the noble Lord, Lord Clement-Jones. I also support Amendments 26, 27, 28, 31, 32 and 35. The Government, in their AI Statement last week, said that ADM will be rolled out across the public sector in the coming months and years. It will increase productivity and provide better public services to the people of this country.

However, there are many people who are fearful of their details being taken by an advanced computer, and a decision which could affect their lives being made by that computer. Surely the days of “computer says no” must be over. People need to know that there is a possibility of a human being involved in the process, particularly when dealing with the public sector. I am afraid that my own interactions with public sector software in various government departments have not always been happy ones, and I have been grateful to be able to appeal to a human.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Science, Innovation & Technology

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Science, Innovation & Technology

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Report stage
Tuesday 28th January 2025

(1 year, 6 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts Amendment Paper: HL Bill 57-II Second marshalled list for Report - (24 Jan 2025)

Lord Stevenson of Balmacara (Lab)

- View Speech - Hansard - - - Excerpts

I am grateful to the noble, Lord Black, for daring to respond to the wonderful speech that opened the debate; I thought I might come in immediately afterwards, but I was terrified by it, so I decided that I would shelter on these Benches and gather my strength before I could begin to respond.

I feel that I have to speak because I am a member of the governing party, which is against these amendments. However, I have signed up to them because I have interests in the media—which I declare; I suppose I should also declare that I have a minor copyright, but that is very small compared with the ones we have already heard about—and because I feel very strongly that we will get ourselves into even more trouble unless action is taken quickly. I have a very clear view of the Government’s proposals, thanks to a meeting with my noble friend the Minister yesterday, where he went through, in detail, some of the issues and revealed some of the thinking behind them; I hope that he will come back to the points he made to me when he comes to respond.

There is no doubt that the use of a copyright work without the consent of the copyright owner in the United Kingdom is an infringement, unless it is “fair dealing” under UK copyright law. However, because of the developments in technology—the crawlers, scrapers and GAI that we have been hearing about—there is a new usage of a huge number of copyright works for the training of algorithms. That has raised questions about whether, and if so how, such usage has to be legislated for as “fair dealing”—if it is to be so—or in some other way, if there is indeed one.

It is right, therefore, for the Government to have required the IPO to carry out a consultation on copyright and AI, which we have been talking about. However, given the alarm and concern evident in the creative sector, we certainly regret the delay in bringing forward this consultation and we are very concerned about its limited scope. Looking at it from a long way away, it seems that this is as much a competition issue as it is a copyright issue. It seems to me and to many others, as we have heard, that the IPO, by including in the consultation document a proposed approach described as an “exception with rights reservation”, has made a very substantial mistake.

This may just be a straw-person device designed to generate more responses, but, if so, it was a bad misjudgement. Does it not make the whole consultation exercise completely wasteful and completely pointless to respond to? When my noble friend the Minister comes to respond, I hope that he, notwithstanding that proposed approach, will confirm that, as far as the Government are concerned, this is a genuine consultation and that all the possible options outlined by the IPO—and any other solutions brought forward during the consultation—will be properly considered on their merits and in the light of the responses to the consultation.

What the creative industries are telling us—they have been united and vehement about this issue, as has already been described, in a way that I have never seen before—is that they must have transparency about what material is being scraped, the right to opt in to the TDMs taking place and a proper licensing system with fair remuneration for the copyright material used. The question of whether the GAI developers should be allowed to use copyright content, with or without the permission of the copyright owner, is a nuanced one, as a decision either way will have very wide-ranging ramifications. However, as we have heard, this issue is already affecting the livelihood of our creative sector—the one that, also as we have heard, we desperately need if we are to support a sustainable creative economy and provide the unbiased information, quality education and British-based entertainment that we all value and want to see flourish.

We understand the need to ensure that the companies that want access to high-quality data and copyright material to train their AI models respect, and will be happy to abide by, any new copyright or competition regulations that may be required. However, the proposals we have heard about today—the ones that would come from the consultation, if we have to delay—will probably be very similar to the amendments before the House, which are modest and fair. We should surely not want to work with companies that will not abide by such simple requirements.

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

My Lords, I support Amendments 44A and the consequential amendments in this group in the name of my noble friend Lady Kidron, whose speech has, I think, moved the whole Committee across all Benches.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Consideration of Commons amendments and / or reasons
Monday 12th May 2025

(1 year, 2 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts Amendment Paper: HL Bill 100-I Marshalled list for for Consideration of Commons Amendments - (9 May 2025)

Baroness Hayter of Kentish Town (Lab)

- View Speech - Hansard - - - Excerpts

My Lords, I too will speak to Motion 32A. I thank my noble friend the Minister for his confirmation of the Government’s welcome of the Supreme Court ruling and his welcome of the Sullivan report. I also very much welcome the words that he has used today and thank him for the discussions that we have been able to have.

Can he confirm that where the Equality Act allows for a women-only space, any digital IT system used for that purpose would refer to biological sex as the relevant information? With regard to public authorities, I assume that organisations such as Sport England and the GMC are counted as public authorities because they are statutory. At the moment the GMC does not record the biological sex of doctors, only the gender. When that also goes digital, will it be confined to biological sex so that, again, patients can know the sex of their physician, assuming that it will be digital? I think that the Minister understands the questions I am posing and that his wording does give that reassurance, but any clarity would be welcome.

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

My Lords, I stand in support of my Motion 43A. I welcome so much of this Bill. I want this country to be a champion of technology and hope that it becomes a tech powerhouse, attracting hundreds of millions of pounds-worth of investment in the development of AI. I understand the concerns expressed by the Minister, but I am still pressing ahead with this amendment because I want the people of this country to have control of their data and how it is used.

This amendment is a push-back against the way the AI companies have been abusing the use of people’s data in training their AI models. Last year, Meta reused data from Instagram users without their consent to train up its Llama AI model. Once this was discovered, there was a huge outcry from the owners of the data and an appeal to the ICO. As a result, Meta stopped the processing and the ICO said,

“it is crucial that the public can trust that their privacy rights will be respected from the outset”.

I want to make sure that when the Bill becomes law, it reassures the people of this country that they can trust the new technology. The battle to stop the abuse of data is a central concern of my indomitable noble friend Lady Kidron, who is sitting beside me and whose amendment is in the next group. It responds to the theft of copyright belonging to millions of creatives, including authors and artists, by AI companies. As it stands, Clause 67 gives a powerful exemption, allowing AI companies to reuse data without consent if they can show that their work aligns with the definition of “scientific research” set out in the Bill. I fear that this definition is so widely drawn that it will allow AI models to reuse data without consent, claiming that they are carrying out scientific research when in fact they are using it for product development and their own profit.

I thank the Ada Lovelace Institute for its constant support throughout the lengthy progress of this Bill. I expressed my concern in Committee and on Report. Chi Onwurah, the very respected chair of the Science, Innovation and Technology Committee in the other place, tabled a similar amendment. However, despite meetings with Ministers, they have offered nothing to assuage our concerns, which has forced me to push this amendment at this stage.

Proposed new paragraph 2A inserted by this amendment would tighten the definition of what counts as scientific research. It is taken from the Frascati manual, developed by the OECD in order to compare R&D efforts made by different companies and identify what key features underpin them. The Government support the Frascati definition. In Committee, the Minister said the research test set out in the Bill “will not operate alone”, and will

“be in the context of the Frascati definition and the ICO’s guidance”.—[Official Report, 21/1/25; col. 1637.]

He said that the Frascati definitions are merely guidance and that codification would bring burdens on scientific researchers, but this is not a new requirement: it is simply a codification of an existing standard set up by the ICO.

The central feature of this part of the amendment is that scientific research should increase the stock of human knowledge. The Minister has told your Lordships that not all scientific research will be new knowledge, that scientific research is often refuted or confirms previous findings, and that some scientific research will fail. But if there is refutation or confirmation of an experiment, that is an extension of human knowledge. Even if research fails, the researcher will know that the experiment does not work, and that is new knowledge. The requirement for scientific research to increase the stock of knowledge is a sensible precaution to preserve our data from abuse, and it will weed out the tech companies piggybacking on the clause for their own profit.

The purpose of this amendment is not just to tighten the definition. It is also to make sure that researchers have to consider it when they start to deploy the exemption for the reuse of data. The Minister has said it will lead to undue burden on scientists and stop research going ahead, but this definition is already being used by the ICO. The problem for a person whose data is being abused is that at the moment, if they want to appeal against its use without consent, they have to go to the ICO, which then has to apply the Frascati definition.

The ICO’s latest statistics show that only 12% of data protection complaints are dealt with within 90 days, compared with the target of 80%. Surely that means it is too late for the appeal against reuse of data without consent. The data will already have been absorbed into the AI training model and, as we have been continually told, it is hard for AI researchers to identify data once it is included in part of the model.

Proposed new paragraph 2A inserted by this amendment would stop this happening. By our putting a definition in the Bill, the AI researchers would have to consider it before reusing the data for their model, therefore saving data subjects having to appeal to the ICO if they are concerned about abuse.

Proposed new paragraph 2B inserted by this amendment responds to the Government’s claim that the “reasonably described” test in this clause is a tightening of the definition of scientific research. Over 14 of our leading law companies have looked at the Government’s test as set out in the Bill and described it variously as loosening, expanding or broadening the definition. However, Clause 67 asks the question whether the research can be reasonably described as scientific. The ICO or the courts will have to consider whether it is irrational to call this scientific research, but it is very hard to prove irrationality; it is a high bar.

I hope noble Lords will agree that the use of the usual reasonableness test asks, “Would a reasonable person conducting scientific research perform this activity in this manner?”. This test evaluates actual conduct against an objective standard of what constitutes proper scientific research.

The amendment seeks to realise what is already a requirement: that such research be conducted in line with standards based on the UK Research and Innovation Code of Practice for Research. It would ensure transparency for the use of scientific research. I am sure that during the course of the debate we will hear from scientists who will say that this debate will stifle research and stop new researchers undertaking work. However, this requirement is minimal, and the information required is that which researchers should already have to hand.

What I ask your Lordships to bear in mind when voting is that this amendment would give transparency into how people’s data is being reused. The new tests laid out in my amendment would be a powerful weapon in the fight against the abuse of people’s data. I want the new technologies to be successful, but they will be successful only if they have the trust of the people of the country. If people think that the Government have caved in to tech companies and allowed them to pillage our data for their own financial gain rather than for the progress of human knowledge, most will be outraged. I ask the Minister to assuage these fears and ensure that the Bill provides data in the people’s interests. Meanwhile, I will ask the opinion of the House at the end of this debate.

--- Later in debate ---

Moved by

Viscount Colville of Culross

- Hansard - -

43A: At end insert “, and do propose Amendment 43B instead of the words so left out of the Bill—

43B: Clause 67, page 75, line 28, at end insert—
“2A. For the purposes of paragraph 2, “scientific research” means creative and systematic work undertaken in order to increase the stock of knowledge, including knowledge of humankind, culture and society, and to devise new applications of available knowledge.
2B. To meet the reasonableness test in paragraph 2, the activity being described as scientific research must be conducted according to appropriate ethical, legal and professional frameworks, obligations and standards.””

Viscount Colville of Culross (CB)

- Hansard - -

My Lords, I listened carefully to the speeches of the noble Lords, Lord Winston and Lord Tarassenko, but I am not convinced that my amendment would stop the research as they suggested. However, it would protect users’ data as the technological revolution unfolds. I beg leave to test the opinion of the House.

Return to start of debate - Return to top of page

Data (Use and Access) Bill [HL] Debate

Full Debate: Read Full Debate
Department: Department for Business and Trade

Viscount Colville of Culross

Main Page: Viscount Colville of Culross (Crossbench - Life peer)

Department Debates - View all Viscount Colville of Culross's debates with the Department for Business and Trade

Data (Use and Access) Bill [HL]

Viscount Colville of Culross Excerpts

Consideration of Commons amendments and / or reasons
Monday 19th May 2025

(1 year, 2 months ago)

Lords Chamber

Share Debate

Read Full debate Data (Use and Access) Act 2025 Read Hansard Text Watch Debate Read Debate Ministerial Extracts Amendment Paper: HL Bill 102-I Marshalled list for Consideration of Commons Reasons - (16 May 2025)

Baroness Fox of Buckley (Non-Afl)

- View Speech - Hansard - - - Excerpts

My Lords, despite the fact this is not being pushed to a vote—I respect that, and I concede that the Government have made some clarifications, and potentially concessions, along the way in this debate—I think that the issue is not yet resolved. I call on the Government to try to solve this problem now, rather than leave it open to more years of muddle, confusion and misinterpretation, and that can happen away from here. I have noticed that the Government are not averse to using the odd statutory instrument, to which I am usually opposed; in this instance, I urge them to use a statutory instrument to sort this out. I fear that, unless they do, it will undermine trust in the new system.

To clarify, we are looking to identify datasets that have muddled up sex and gender, such as data from HMPO and the DVLA, and those that have not, such as sex registered at birth. Because of that muddle, we cannot rely on those databases. Is that not the very point? We are trying at this point to provide clarity to DVS providers. By the way, this would not in any way result in outing individual transgender people when they are using the DVS system to prove their identity or other attributes, such as their age or whatever. We are trying to ensure that each database has some consistency. If a dataset allows some people to be recorded as the wrong sex, then the whole dataset is unreliable as a source of sex data.

It was very helpful that the Government clarified in the midst of this, for example, that an official document such as a passport, whatever is written on it, cannot be proof of a change of sex; it is simply a record of the way somebody wants to be identified and is no use as a reliable source of sex data. As I have said, there are other official documents such as the driving licence where that is not the case.

I would simply urge the Government, from their own point of view, so that we do not carry on having this muddle and confusion and so that this system becomes trusted, to make sure that they sort this out, even if they will not do so here and now.

Viscount Colville of Culross (CB)

- View Speech - Hansard - -

My Lords, I thank the Minister for her engagement and for defining what genuine scientific research is. I hope very much that the AI companies, when using this extraordinary exemption, will listen to the Government, and that the Government will ensure that the policy is enforced. The trust of the people of this country would be lost if they felt that their data was being reused by AI companies simply for product enrichment and profit, rather than for genuine scientific research. I thank the noble Viscount, Lord Camrose, and the noble Lord, Lord Clement-Jones, for their parties’ support.

Lord Clement-Jones (LD)

- View Speech - Hansard - - - Excerpts

My Lords, I too thank the Minister for her introduction to the three Motions in this group.

On these Benches, we welcome the Supreme Court’s judgment on the meaning of “sex” in the Equality Act 2010. However, as Ministers have stressed—and we agree—it is paramount that we work through the implications of this judgment carefully and sensitively. As we have previously discussed, the EHRC is currently updating its statutory guidance.

Ministers have previously given assurances that they are engaged in appropriate and balanced work on data standards and data accuracy, and we accept those assurances. They have given a further assurance today about how the digital verification services framework will operate. We rely on those ministerial assurances. In summary, we believe that the previously proposed amendments were premature in the light of the EHRC guidance and that they risk undermining existing data standards work. On that basis, we support the Minister in her Motions A and D.

Turning to Motion B, the noble Viscount, Lord Colville, will not press his Amendment 43B at this stage, as he intends to accept the assurances given by Ministers. We have consistently supported the noble Viscount’s efforts to ensure that scientific research benefiting from the Bill’s provisions for data reuse is conducted according to appropriate ethical, legal and professional frameworks. The Government have given significant assurances in this area. We understand that their position is that the Bill does not alter the existing legal definition or threshold for what constitutes scientific research under UK GDPR. The Bill does not grant any new or expanded permissions for the reuse of data for scientific research purposes, and, specifically, it does not provide blanket approval for using personal data for training AI models under the guise of scientific research. The use of personal data for scientific research remains subject to the comprehensive safeguards of UK GDPR, including the requirement for a lawful basis, the adherence to data protection principles and the application of the reasonableness test, which requires an objective assessment.

The collection of assurances given during several stages of the Bill provides reassurance against the risk that commercial activities, such as training AI models purely for private gain, could improperly benefit from exemptions intended for genuine scientific research serving the public good. I very much hope that the Minister can reaffirm these specific points and repeat those assurances.

Return to start of debate - Return to top of page

All 10 Viscount Colville of Culross contributions to the Data (Use and Access) Act 2025

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]

Data (Use and Access) Bill [HL]