Skip to content

Croissant Builder does not have the DatasetBuilder name #11045

@stefanches7

Description

@stefanches7

/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET

Short description
Description of the bug.

Environment information

  • Operating System: win 11 pro

  • Python version: 3.12.9

  • tensorflow-datasets/tfds-nightly version: 4.9.8

  • tensorflow/tf-nightly version: 2.19.0

  • Does the issue still exists with the last tfds-nightly package (pip install --upgrade tfds-nightly) ? yes

Reproduction instructions

import tensorflow_datasets as tfds
import mlcroissant as mlc


builder = tfds.dataset_builders.CroissantBuilder(
    jsonld="export-dara.jsonld"
 )
builder.download_and_prepare()

where export-dara.jsonld is link

If you share a colab, make sure to update the permissions to share it.

Link to logs

Image

Expected behavior
Info of the dataset gets created

Additional context
Facing this issue with multiple dataset croissants on the web, e.g. https://www.scidb.cn/en/detail?dataSetId=067dfc6234b140ef9e3ba7b91c27684d# too

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions