Skip to content

Contig count explodes from hundreds to 4000 after LongStitch pipeline #95

@rubadevis

Description

@rubadevis

I'm running the complete LongStitch pipeline on a mammalian genome (~2.5Gb) using standard parameters. However, my final contig count jumped dramatically from a few hundred (pre-LongStitch) to 4000 contigs, despite maintaining excellent assembly metrics:

N50: 73Mb (chromosome-scale)

Issues:
Is this expected behavior for mammalian genomes with moderate repeats (~40-48% LINEs)?

How can I reduce contig count while preserving N50 73Mb?

Parameter tweaks for LongStitch to favor scaffolding over fragmentation:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions