I Want a one line FASTA!

15 visualizaciones (últimos 30 días)
Bio_Ing_Sapienza
Bio_Ing_Sapienza el 20 de En. de 2019
Comentada: Bio_Ing_Sapienza el 20 de En. de 2019
Hi everyone. I want to write a FASTA file but my sequence once it has been written has to be on one unique line in my FASTA file and not splitted up as we find when we take our file from a databank. e.g.
I do not want this situation but I want all my lines aligned .
Thanx a lot!
  2 comentarios
Image Analyst
Image Analyst el 20 de En. de 2019
I don't even understand that syntax - how it could even run on the command line. What is lcl? What is the structure KQ001645? Why do you have an assignment in brackets [locus_tag=AK88_00001]? And why does it follow the OR test "lcl | KQ0......." with a space? etc.
Post your code where we can actually copy it and run it.
Stephen23
Stephen23 el 20 de En. de 2019
Bio_Ing_Sapienza's "Answer" moved here:
.....That is not a code...Is the original fasta file that taken from NCBI and splitted in several line.Above every thing there is the header of my sequence, and then there is my aminoacid sequence. I want that sequence that you see in the screenshot THAT IS NOT EVIDENTLY A SCRIPT, to be on a single line....I'm trying to use fastawrite and regexp with the regular expression
EXPR='.*';
regexp(sequence{1},EXPR,'match','dotexceptnewline');
%this return me the framgents without problems but in the moment that I use FASTAWRITE it returns me the sequence once
%once more splitted in the same way I started.

Iniciar sesión para comentar.

Respuesta aceptada

Arthur Goldsipe
Arthur Goldsipe el 20 de En. de 2019
The fastawrite has a hard-coded limit of 70 characters per line. This follows NCBI recommendations. If you want to incease the maximum line length, I can think of 2 options. Either you modify the file created by fastawrite or you edit variable maxcols in fastawrite to be larger than your longest sequences.
  1 comentario
Bio_Ing_Sapienza
Bio_Ing_Sapienza el 20 de En. de 2019
it works. just modiefied the maxcols. Thanks a lot!

Iniciar sesión para comentar.

Más respuestas (0)

Categorías

Más información sobre Genomics and Next Generation Sequencing en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by