The occurrence of N- and C-terminal expansin domains in CBM63 sequences from the CAZy database. Protein sequences are represented by NCBI accessions. Expansin domains were annotated with the hmmscan command from the HMMER software package. The hits were filtered by a minimal domain-based score of 35 or 20 (chosen after comparison with HMMER’s domain-based “independent” e-values), a minimal hit length of 60 amino acids, and a maximal ratio of bias over domain-based score of 10%.
Sequences were downloaded from the Carbohydrate-Active enZYmes Database (CAZy) on June 3, 2019.