FAQ

My project has been approved, what next?

The phenotype and genotype data are stored at separate institutions, both groups will be in contact separately to arrange data transfer. Note that all data is reindexed on a custom ID for you, so manual processing has to be done before we transfer the data to you.

How will I get access to the data?

Once we have built your genetics data extract we will make it available to you either via a link you can visit to download the data, or we will set up an SFTP account for you to download the data, depending on the size of the data.

How do I decrypt the data you have sent me?

Using Linux you can decrypt and uncompress the data files with commands like:

gpg 2016_APPLICATION_NAME.tar.gz.gpg
tar xvf 2016_APPLICATION_NAME.tar.gz

What is the MD5SUM file?

Whenever large volumes of data are transferred over a network there is a chance they will be corrupted en route. This can happen even over the best networks and may not be obvious in the data itself (a file could be trunkated for instance). To make sure the files you recieve are the same as the files we generated we calculate an MD5SUM of each file, when you recieve the data you should calculate an MD5SUM and confirm they match.

I now realise that I should have asked for some extra data sets.

Not to worry! Get in contact with METADAC and request an amendment. As long as the extra data doesn't change the original proposal (e.g. asking for data that is not relevant to the science question) or doesn't ask for different types of data (e.g. asking for sequence data when the original proposal did not) then it should be straightforward to accommodate this. This is still slower than getting it right first time though, so don't rely on this approach!