Big Data Danger

This post comes mainly courtesy (again) of the astute Steve Hawkins, who responded to my concerns in the Getting Airborne post about the possible dangers of a MEGA Biobank. Over to Steve:

On the ‘big data’ front, I think that all genuine physical measurements will be useful if used in the right way. The danger comes from any extraction/filtering that uses diagnosis as the reference field. If they do that – and I’m sure Crawley and Co would, because they think they can diagnose without biomarkers – the results would be garbage, as there would be many conditions given the wrong name but appearing together.

On the other hand, filtering on key concrete signs like PEM, POTS, bedbound, etc. would pull up useful groupings whatever the ostensible diagnosis.

In the wider scheme of things, there are now a number of entrepreneuring projects aiming to collect ALL big medical data, and link all medical databases together. I read a good piece on this recently by one bioinformatician who is setting up a giant server, but I don’t seem to have bookmarked it. Here is a conference on getting all genomic information into ‘the cloud’ for free searching and filtering: And one from The Lancet, on the astronomical amount of data that is about to flow from mobile phones whose apps have turned into our version of Star Trek’s ‘tricorder’. All this info will go into ‘the cloud’:

So we’re getting to the stage where all data is useful: so long as it is faithfully produced. Sadly, we know from PACE that data will have to be graded by association with researcher, and those who cannot be trusted will have their data discarded. There is nothing they can do about this: if their name is on their shoddy work, it will go nowhere, and all the data they collected will be wasted.

There lies the danger of MEGA: not that it will pollute the big data, but that any good data it contains will be at risk of being discarded by everyone but Crawley and her associates. That is why patients should NOT let their data be associated with MEGA while Crawley is involved.

I think Steve has nailed it there, and as it seems unlikely that Prof Crawley will be willing to part company with MEGA, I still believe that we should sign the OMEGA (Opposing MEGA) petition to demonstrate our strength of feeling against the proposal as it stands. The original pro-MEGA petition has now been closed – perhaps because they realised that more people were taking their signatures off than were putting them on – but the OMEGA team are still promoting their counter petition. Here is their latest blog. Scroll to the end for the link to their petition or just click here.

Advertisements

Getting Airborne

Steve Hawkins, who often comments here at the blog and quietly does a lot of useful activist stuff behind the scenes, left the following comment/proposal on the OMEGA petition site (and added it here in response to the previous post). I thought it was worthy of a wider audience so I’m reposting it here to kick off today’s blog:

‘It seems unfortunate that there has to be a petition of this kind against what, in the right hands, and with careful preparation of protocols in advance, would undoubtedly be a gathering of very useful data; and I feel uncomfortable that this will discourage some of the very able researchers and research teams who have been brought into the MEGA group but had no part in earlier ill advised research proposals; but it seems that something of this sort will have to be done, to ensure a complete new start, and clean break with the discredited ‘science’ of biopsychosocial egotists.

‘I apologise to the, well-meaning, I’m sure, Prof. Holgate, and those others who I fear have had to be reticent in criticising poor research, because of the binding conditions that were attached to membership of the Research Collaborative, under the direction of the partisan ‘Science Media Centre’, but the time really has come to return to both freedom of speech and information in this research field, after the gambit of crying ‘harassment’ after any honest questioning, has been so clearly shown up for what it was, in the courts.

‘I would advise that a new steering group be set up for a large and inclusive, data gathering and biomic sequencing and typing study with the major emphasis on the severely affected, who are the most likely to yield clear differences worthy of more intensive study. By all means collect data from a quota of less severely disabled/sick patients as well, but only to the number necessary to provide a control match for each of the seriously ill study subjects. A similar number of healthy controls will also be needed.

‘Thus the size and expense of the study should stem from the maximum number of seriously ill participants for statistical certainty… (plus controls). If that turns out to be a very big cost Continue reading “Getting Airborne”