Fig. 2

Overview of CAULIFINDER Branch B workflow. The line of three arrows on the top represents the three main steps of the workflow. Grey boxes indicate successive sub-steps with the main tools highlighted with red font. The main output files are shown in blue boxes. The input datasets are shown in khaki boxes with arrows indicating in which sub-step they are used. The grey looping arrows in steps 2 and 3 indicate the number of iterations of sequence selection using protein alignment with MUSCLE, followed by trimAl with empirical parameters