Baroness O'Neill of Bengarve
Main Page: Baroness O'Neill of Bengarve (Crossbench - Life peer)Department Debates - View all Baroness O'Neill of Bengarve's debates with the Home Office
(13 years ago)
Lords ChamberMy Lords, this Bill has quite a grand Title but a somewhat miscellaneous feel. Although this is a Second Reading debate, I will concentrate on one part of the Bill, which has already been mentioned. I hope to ask the Minister a range of questions about Part 6, which makes provision for the publication of data sets held by public authorities. I am not against what is generally called “transparency”. In fact, it is a great deal weaker than most people suppose. Transparency is an antidote to secrecy; its defect is that it is no guarantee of communication of any sort. That form of quasi communication can sometimes create an illusory advance in the things we try to do.
As I understand it, the Bill covers data sets which are neither government data—that is, produced or commissioned by government or government-controlled entities—nor official statistics. Both government data and official statistics are, after all, already subject to requirements for openness. The intention of the Bill is to bring data sets held by other public authorities—the term is slightly curiously used here—under similar, though not quite the same, requirements for openness. The other public authorities, I take it, include not only local authorities but, above all, universities, hospitals and research institutions and, no doubt, the publicly owned companies to which the Minister referred. I am not quite sure what sort of companies those are because I note that hospitals, universities and research institutions are not companies and are not publicly owned companies.
It is a complex problem and I should declare a non-pecuniary interest at this point as a member of the Royal Society Working Group on Science as a Public Enterprise. This group is looking specifically at the problem of scientific data sets—which are not always published, even when there are publications—and will report within the next few months.
Let me now return to the Bill. Given that the period within which institutions have to respond to requests to make data available is very short—three weeks or fewer—it seems to me that one effect, or perhaps purpose, of this Bill is to require investigators to make research data available on an ongoing basis, if requested, and in an ongoing way. Can the Minister confirm that the Bill removes any discretion to wait until a data set has been completed or checked, let alone until the research project is completed or the work based on that research project is published? Does the Bill require disclosure to competing research groups if they simply request that the data be made available to them? Does it require disclosure before a decision about possible commercialisation has been reached—or, indeed, can be reached?
I know that some noble Lords hope that there is a Scots remedy to this. I yield to nobody in my admiration for the remedies that have been taken in Scotland for certain matters; but, unfortunately, it is a fantasy that waiting for publication is a real solution to the problem. I will come back to why. Before that, I would like to ask a few questions about the form in which data would have to be released. Clause 100 tells us that,
“the public authority must, so far as reasonably practicable, provide the information to the applicant in an electronic form which is capable of re-use”.
At one level, that is clearly very sensible. The provisions for the release of government data are, of course, even stronger. They must be released in a form that can be freely used, reused and redistributed to anyone—subject only, at most, to the requirement to attribute and share alike.
I do not think that the present Bill demands quite as much, in that it permits controllers of data sets that have to be released to charge a fee for reuse if they are the sole owner of a copyright in the relevant database. Am I right in reading the Bill as requiring these public authorities to release data they control to competitors, subject only to the right to charge a fee? Am I right that there is nothing to prevent those who have secured release of data from reusing them for commercial and other purposes, without needing any consent from those who compiled the data?
Do the Government envisage defining a stage in a research project before which the requirements to release data would not apply? Or does the Bill ensure that a research team at a UK university must share their data with all-comers, including scientific and commercial competitors? Do the Government envisage that overseas researchers will reciprocally open their databases to scientific and commercial competitors? If this is not the picture that the Government have in mind, it would be very helpful if the Minister could say something about the stage of a research project at which data would have to be made available upon request. I do not think we can solve this problem by referring to a point of publication: most big data sets have many publications associated with them, which occur at many stages. Which of them would be decisive for triggering susceptibility to be requested for release of the data set?
Finally, I have some boring questions about costs. A requirement to make data sets available in electronic form that permits reuse does not sound onerous if one thinks about little data sets. However, a great deal of scientific research today compiles massive data sets. We are not talking about megabytes or gigabytes of information, but about terabytes and petabytes onwards. Making data sets of this size available is not done by the click of a mouse: it is expensive, time consuming and skilled. If data sets are to be reusable by others, it is likely that those who provide them will have to make metadata available. Do the Government have a view of the costs of this provision of the Bill? Have the Government identified who will bear the costs? Who is responsible for data archiving, for data integrity and for data security? On which budgets do the costs fall? What provisions do the Government have in mind against data mining that peels away anonymisation from ostensibly anomynised personal data? Do the Government think that these provisions are likely to incentivise public authorities which have such large databases to enter into partnership with private sector organisations which will hold the data—thereby eluding the provisions of this legislation? Open communication matters for science and for citizens; but mandated disclosure of reusable data in response to all and any requests, regardless of costs and time constraints, may not be the best way forward.
How you would interpret those words is really a question of fact and degree. I will have a further look before we get to Committee to see whether I can write in greater detail on that. If I cannot, I am sure that it is something that we would want to discuss in greater detail in Committee and at later stages.
Finally, I come to freedom of information and data protection in Part 6. I will touch on this only very briefly because I understand the concerns expressed by my noble friend Lady Hamwee, the noble Lord, Lord Bew, and the noble Baronesses, Lady O’Neill and Lady O’Loan, about the publication of research, particularly early publication. I accept that there is a genuine concern coming from Universities UK.
I just want to clarify that. There is absolutely no concern about the publication of research. That is what researchers aim to do. The concern is about applying the publication criteria to databases which are of a size that precludes their being published in journals, monographs or any other way. These are causing concern for large numbers of research institutions which have such databases but are committed to open publication.
I am sorry if I misunderstood the noble Baroness. I have written down “pre-publication”. I will look carefully at what she had to say. Certainly, I hope that we can address that in due course. The noble Lord, Lord Bew, said that we should copy Scotland but I think that the noble Baroness, Lady O’Neill, was not so keen on that idea. Again, we need to address these matters in due course and examine them in a manner that I properly understand, particularly as I just seem to have misunderstood the noble Baroness on this occasion. The noble Baroness went on to ask what she described as some boring questions about costs. As they are allegedly boring questions—I am sure they are not—I will address them in a letter.
That is a rather rapid gallop through some of the comments that we have received today. I thank all noble Lords for their contributions. If we can agree on nothing else, we can agree that we will have an interesting and possibly somewhat lengthy Committee stage. As noble Lords will know from the Motions that I will move after this Bill has had its Second Reading, it will move down a novel line with some of the Committee stage taking place in the Chamber on the more contentious issues and some taking place in Grand Committee. I hope that that will have the agreement of the House and that once the Second Reading Motion is agreed, your Lordships will permit me to move the other Motions that stand in my name.