Why are some diagnostics NA?
nearest_neighbor_discrimination and barcoding_gap require intraspecific data. They are reported as NA when no species has at least two samples, or when the alignment does not provide enough valid within-species pairwise distances to estimate the statistic. These metrics need at least one species represented by two or more samples, because MarkerSeek must compare distances among samples from the same species. For example, barcoding gap uses the maximum intraspecific distance, and nearest-neighbor discrimination checks whether a sample's closest sequence includes a conspecific sample. If every species has only one sample, those intraspecific comparisons do not exist.
Related within-species diagnostics, including intraspecific divergence and misclassification risk, follow the same rule. In candidate_marker_features.tsv, these cells remain NA; the web table and this FAQ provide the reason without adding extra columns to the data file.