bbsunchen 发表于 2013-2-1 11:59:57

统计fasta格式数据数量

今天很2b地用perl自己写了个统计fasta格式数据量的script

#!/usr/bin/perl -w# Program name: detectDataNum.pl# Author      : SunChen# Contact   : bbsunchen@gmail.com# Date      : 04/21/2011# Last Update : 04/21/2011# Reference   : Please cite our following papers when you are using this script.#None## Description : connect 2 meta-pair files to 1 file for the PE assembler.#===========================================================================use warnings;use strict;use Getopt::Long;my %opts;GetOptions(\%opts,"f:s");my $usage= <<"USAGE";Program: $0INPUT:-fblabla...USAGEdie $usage unless $opts{f};open DATA, "< $opts{f}" or die "Can't open file ".$!;my $lines = 0;while(<DATA>){my $data = $_;chomp($data);#deal with \n$data=~s/\r//g;#deal with \rnext unless($data=~/\S+/); #deal with blank line here.if($data =~ m/^>/){$lines++;}}close DATA;print "data num is $lines /n"; 实际上只要用一句话就行...

grep -c "^>" s1.fa grep -c "^>" s1.fa s2.fa命令得到,比如分别为100和200
页: [1]
查看完整版本: 统计fasta格式数据数量