Transfering data from Andromeda during clean-out

Andromeda clean out

The week of 4-14-2025, I cleaned out my folder on Andromeda /data/putnamlab/estrand/. I got rid of intermediate files that I don’t need but need to transfer some files to OSF and some to Unity for further analyses.

HoloINT WGBS

Raw data = /data/putnamlab/KITT/hputnam/20211008_HoloInt_WGBS. This should be zipped and moved to Andromeda until the paper is published.

Andromeda to OSF

HoloInt_WGBS will go to this OSF page: https://osf.io/ghsv8/. OSF has a client that was downloaded to Andromeda.

interactive

cd HoloInt_WGBS 

module load osfclient

## set username and OSF information
osf init
## Provide a username for the config file [current username:  ]:
emma_strand@uri.edu

## Provide a project for the config file [current project:  ]:
ghsv8

## list projects
osf ls 

## prompted to input password for the OSF project -- success! connected the ploidy OSF 

Transfer test file

osf -p ghsv8 -u emma_strand@uri.edu upload multiqc_report.html methylseq_output/multiqc_report.html
## this worked! now transfering larger files.

Transfer important files. I’m only transferring the files that are key intermediate steps that take the longest which is the alignment and two subsequent steps. This paper will be in review soon and hopefully we won’t need to take these files back through a pipeline.

osf -p ghsv8 -u emma_strand@uri.edu upload WGBS_methylseq_HoloInt_genomev2_multiqc_report.html methylseq_output/WGBS_methylseq_HoloInt_genomev2_multiqc_report.html

osf -p ghsv8 -u emma_strand@uri.edu upload bismark_summary_report.html methylseq_output/bismark_summary_report.html

osf -p ghsv8 -u emma_strand@uri.edu upload bismark_summary_report.txt methylseq_output/bismark_summary_report.txt

The biggest files I want to transfer are the tab gene files. Everything else is intermediate and I can redo on Unity if absolutely needed based on a peer review.

cd /data/putnamlab/estrand/HoloInt_WGBS/merged_cov_genomev2/tab_files

# Upload using wildcard (requires existing methylseq_output/ folder on OSF)
for file in *_gene; do
  osf -p ghsv8 -u emma_strand@uri.edu upload "$file" "methylseq_output/$file"
done
### this worked for 5 files but then stopped.. maybe I'm running out of OSF space..
### these should be zipped before transferring

zip -r tab_files.zip ./*

## Now upload repo zipped instead of individual files
osf -p ghsv8 -u emma_strand@uri.edu upload tab_files.zip methylseq_output/tab_files.zip

Also transferring the .cov files

cd /data/putnamlab/estrand/HoloInt_WGBS/merged_cov_genomev2
ls *.cov
## each sample has 2 files: 1047.CpG_report.merged_CpG_evidence.cov and 1047_sorted.cov 
## ill save both 

zip -r cov_files.zip ./*.cov

osf -p ghsv8 -u emma_strand@uri.edu upload cov_files.zip methylseq_output/cov_files.zip
## this didn't work because OSF has a 5 GB storage limit
## this would be the best intermediate to save other than the tab files..
## raw data is already on NCBI 
mv cov_files.zip Strand_HoloInt_WGBS_cov_files.zip

zip -r script_files.zip scripts/*
osf -p ghsv8 -u emma_strand@uri.edu upload script_files.zip script_files.zip

Andromeda to Unity

Strand_HoloInt_WGBS_cov_files.zip and the raw data in a tar folder but permission denied. Need Hollie to do this

cd /data/putnamlab/KITT/hputnam/20211008_HoloInt_WGBS

tar -czvf Strand_HoloInt_WGBS_rawdata.tar.gz ./*gz 

BleachingPairs WGBS

Raw data =

Switching OSF project to bleaching pairs

cd /data/putnamlab/estrand/BleachingPairs_WGBS

### switch the OSF project
osf init
emma_strand@uri.edu
p9txk

cd /data/putnamlab/estrand/BleachingPairs_WGBS/scripts
zip -r script_files.zip scripts/*

osf -p p9txk -u emma_strand@uri.edu upload script_files.zip script_files.zip

tar -czvf Strand_BleachingPairs_WGBS_bed.tar.gz ./*5x_enrichment.bed
tar -czvf Strand_BleachingPairs_WGBS_tab.tar.gz ./*5x_sorted.tab

## transfer those files to OSF 
module load osfclient
osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_WGBS_bed.tar.gz Strand_BleachingPairs_WGBS_bed.tar.gz
osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_WGBS_tab.tar.gz Strand_BleachingPairs_WGBS_tab.tar.gz

Bleaching Pairs 16S Testing

cd /data/putnamlab/estrand/Test_V3V4_16S/sample_sets/V4_515F/raw_data

tar -czvf Strand_BleachingPairs_16STesting_V4_515F_rawdata.tar.gz ./*gz
osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_16STesting_V4_515F_rawdata.tar.gz Strand_BleachingPairs_16STesting_V4_515F_rawdata.tar.gz

cd /data/putnamlab/estrand/Test_V3V4_16S/sample_sets/V3V4_341F/raw_data

tar -czvf Strand_BleachingPairs_16STesting_V3V4_341F_rawdata.tar.gz ./*gz
osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_16STesting_V3V4_341F_rawdata.tar.gz Strand_BleachingPairs_16STesting_V3V4_341F_rawdata.tar.gz

cd /data/putnamlab/estrand/Test_V3V4_16S/sample_sets/V3V4_338F/raw_data
tar -czvf Strand_BleachingPairs_16STesting_V3V4_338F_rawdata.tar.gz ./*gz
osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_16STesting_V3V4_338F_rawdata.tar.gz Strand_BleachingPairs_16STesting_V3V4_338F_rawdata.tar.gz

Bleaching Pairs 16S Data

cd /data/putnamlab/estrand/BleachingPairs_16S/Mothur

tar -czvf Strand_BleachingPairs_16S_rawdata.tar.gz ./*gz

osf -p p9txk -u emma_strand@uri.edu upload Strand_BleachingPairs_16S_rawdata.tar.gz Strand_BleachingPairs_16S_rawdata.tar.gz

Bleaching Pairs RNASeq

This is already on NCBI and counts files are on github so no OSF need.

Point Judith MBDBS

My folder is all temporary working data but I want to re-do this analysis anyway. Just make sure raw transfer is done.

What we do need on Unity that I will ask Jill to transfer:

  • /data/putnamlab/shared/PointJudithData_Rebecca (16S data raw data + analysis)
  • /data/putnamlab/shared/Oyst_Nut_RNA (RNAseq processing)
  • /data/putnamlab/KITT/hputnam/20200119_Oyst_Nut/MBDBS (Methylation MBDBS raw data) - this is already on NCBI
  • /data/putnamlab/KITT/hputnam/20200119_Oyst_Nut/DB AND /data/putnamlab/KITT/hputnam/20200119_Oyst_Nut/NS (RNAseq raw data) - this is already on NCBI

These are already on Unity! Hooray. I will have Jill transfer the Rebecca working folder.

Kevin data

I made file lists of the folders he has raw data for. These will need to go to NCBI

scp emma_strand@ssh3:/data/putnamlab/estrand/*.txt .

Written on April 18, 2025