O'Reilly logo
  • Gabriel Meltzer thinks this is interesting:

# warning: this is incorrect!
$ grep -v "^#" Mus_musculus.GRCm38.75_chr1.gtf | head -n 3 | \
   sed -E 's/.*transcript_id "([^"]+)".*/\1/'

1    pseudogene    gene    3054233    3054733    .    [...]
ENSMUST00000160944
ENSMUST00000160944

What happened? Some

From

Cover of Bioinformatics Data Skills

Note

what does -E and - e do, amd how exactly does this regex work?