202 Accepted when creating Dataset via Dataverse API

49 views
Skip to first unread message

Mike Conway

unread,
Apr 8, 2026, 1:57:58 PM (4 days ago) Apr 8
to Dataverse Users Community
Hi all,

We are creating Dataset via Dataverse API for CAFE Dataverse, which has several custom metadata blocks. We do have about 127 entries published, but as we go to publish a new group of Datasets, we see a variance. 

In the first run at the end of Jan, we saw that the publish would return a DOI. Now, just recently, we ran out pipeline and we get a 202 Accepted and no other information, with no dataset published. 

I've tried to go through and grab the metadata blocks (e.g./api/dataverses/CAFE/metadatablocks?returnDatasetFieldTypes=true) and compare the current complement to the JSON structures we send and it -should- be OK and is valid JSON etc, but we get no indication of anything wrong, just this 202 and no further information...

We are using the Dataverse Python API and here are some logs that show what we see on 

        resp = self.api.create_dataset(dataverse, json.dumps(dataverse_dataset_as_dict), publish=False)

The 202 is not an error, so there are no logs/messages that give any indication on why things do not proceed.

LOGS:


2026-04-01T13:24:25.991-0400] {accel_to_dataverse_dissemination.py:44} INFO - disseminate with descriptor <accelerator_core.workflow.accel_data_models.DisseminationDescriptor object at 0x1232190a0>

[2026-04-01T13:24:25.991-0400] {accel_workflow_task.py:40} INFO - payload resolve for payload <accelerator_core.workflow.accel_data_models.DisseminationPayload object at 0x1204b8c50>

[2026-04-01T13:24:25.992-0400] {dataverse_connector.py:212} INFO - create dataset: {'datasetVersion': {'license': {'name': 'CC0 1.0', 'uri': ''}, 'protocol': '', 'authority': '', 'identifier': '', 'metadataBlocks': {'citation': {'displayName': 'Citation Metadata', 'fields': [{'typeName': 'title', 'multiple': False, 'typeClass': 'primitive', 'value': 'Yale Center for Earth Observation (YCEO) Global Surface Urban Heat Island (GSUHI) Data, Pixel-Level Composites, Version 5'}, {'typeName': 'alternativeURL', 'multiple': False, 'typeClass': 'primitive', 'value': 'https://yceo.users.earthengine.app/view/uhimap'}, {'typeName': 'otherId', 'multiple': True, 'typeClass': 'compound', 'value': []}, {'typeName': 'author', 'multiple': True, 'typeClass': 'compound', 'value': [{'authorName': {'typeName': 'authorName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Other'}, 'authorAffiliation': {'typeName': 'authorAffiliation', 'multiple': False, 'typeClass': 'primitive', 'value': ''}}, {'authorName': {'typeName': 'authorName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Yale Center for Earth Observation (YCEO)'}, 'authorAffiliation': {'typeName': 'authorAffiliation', 'multiple': False, 'typeClass': 'primitive', 'value': ''}}, {'authorName': {'typeName': 'authorName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Academic Institution'}, 'authorAffiliation': {'typeName': 'authorAffiliation', 'multiple': False, 'typeClass': 'primitive', 'value': ''}}]}, {'typeName': 'datasetContact', 'multiple': True, 'typeClass': 'compound', 'value': [{'datasetContactName': {'typeName': 'datasetContactName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Other'}, 'datasetContactEmail': {'typeName': 'datasetContactEmail', 'multiple': False, 'typeClass': 'primitive', 'value': 'nor...@xxx.org'}}, {'datasetContactName': {'typeName': 'datasetContactName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Yale Center for Earth Observation (YCEO)'}, 'datasetContactEmail': {'typeName': 'datasetContactEmail', 'multiple': False, 'typeClass': 'primitive', 'value': 'nor...@xxx.org'}}, {'datasetContactName': {'typeName': 'datasetContactName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Academic Institution'}, 'datasetContactEmail': {'typeName': 'datasetContactEmail', 'multiple': False, 'typeClass': 'primitive', 'value': 'nor...@xxx.org'}}]}, {'typeName': 'dsDescription', 'multiple': True, 'typeClass': 'compound', 'value': [{'dsDescriptionValue': {'typeName': 'dsDescriptionValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'The Global Surface Urban Heat Island (GSUHI) displays composite data for surface urban heat islands on a global interactive map based on the intensity of urban clusters. The data, observed from 2003 to 2020, are obtained from satellite-based land surface temperature observations (MODIS) at 300 m resolution, the Landscan urban extent database, and other land cover and elevation data sources.'}}]}, {'typeName': 'keyword', 'multiple': True, 'typeClass': 'compound', 'value': [{'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Temperature'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Surface-Level'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Intensity'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Heat-Island Intensity'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Urban Cluster'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Daytime'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Nighttime'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Daily Prediction'}}, {'keywordValue': {'typeName': 'keywordValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Global'}}]}, {'typeName': 'topicClassification', 'multiple': True, 'typeClass': 'compound', 'value': [{'topicClassValue': {'typeName': 'topicClassValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Other'}}, {'topicClassValue': {'typeName': 'topicClassValue', 'multiple': False, 'typeClass': 'primitive', 'value': 'Urban Heat Islands'}}]}, {'typeName': 'subject', 'multiple': True, 'typeClass': 'controlledVocabulary', 'value': ['Medicine, Health and Life Sciences', 'Earth and Environmental Sciences']}, {'typeName': 'language', 'multiple': True, 'typeClass': 'controlledVocabulary', 'value': []}, {'typeName': 'publication', 'multiple': True, 'typeClass': 'compound', 'value': [{'publicationRelationType': {'typeName': 'publicationRelationType', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'Cites'}, 'publicationCitation': {'typeName': 'publicationCitation', 'multiple': False, 'typeClass': 'primitive', 'value': 'Chakraborty, T., & Lee, X. (2019). A simplified urban-extent algorithm to characterize surface urban heat islands on a global scale and examine vegetation control on their spatiotemporal variability. International Journal of Applied Earth Observation and Geoinformation, 74, 269-280.'}, 'publicationIDType': {'typeName': 'publicationIDType', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'url'}, 'publicationURL': {'typeName': 'publicationURL', 'multiple': False, 'typeClass': 'primitive', 'value': 'https://doi.org/10.1016/j.jag.2018.09.015'}}]}, {'typeName': 'notesText', 'multiple': False, 'typeClass': 'primitive', 'value': '<b>Intended Use</b><ul><li></li></ul><br/><b>Strengths</b><ul><li>Dataset supports analysis of variation in annual and seasonal average daytime and nighttime mean Urban Heat Island intensity across global urban locations and multiple years in a consistent way</li></ul><br/><b>Limitations</b><ul><li>Dataset derived from urban extent data for 2001 and 2002 that does not reflect patterns of urbanization after 2002</li><li>Data not available for areas outside urban clusters</li><li>Dataset derived from satellite-based observations which may vary in quality due to cloud cover, air quality, and other factors</li><li>Dataset derived from urban extent data for 2001 and 2002 that does not reflect patterns of urbanization after 2002</li><li>Data not available for areas outside urban clusters</li><li>Dataset derived from satellite-based observations which may vary in quality due to cloud cover, air quality, and other factors</li></ul><br/>'}, {'typeName': 'producer', 'multiple': True, 'typeClass': 'compound', 'value': [{'producerName': {'typeName': 'producerName', 'multiple': False, 'typeClass': 'primitive', 'value': 'Global Surface Urban Heat Island (GSUHI)'}, 'producerAffiliation': {'typeName': 'producerAffiliation', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'producerAbbreviation': {'typeName': 'producerAbbreviation', 'multiple': False, 'typeClass': 'primitive', 'value': 'GSUHI'}, 'producerURL': {'typeName': 'producerURL', 'multiple': False, 'typeClass': 'primitive', 'value': ''}}]}, {'typeName': 'depositor', 'multiple': False, 'typeClass': 'primitive', 'value': 'Mike Conway'}, {'typeName': 'timePeriodCovered', 'multiple': True, 'typeClass': 'compound', 'value': [{'timePeriodCoveredStart': {'typeName': 'timePeriodCoveredStart', 'multiple': False, 'typeClass': 'primitive', 'value': '2003-01-01'}, 'timePeriodCoveredEnd': {'typeName': 'timePeriodCoveredEnd', 'multiple': False, 'typeClass': 'primitive', 'value': '2020-12-31'}}]}, {'typeName': 'kindOfData', 'multiple': True, 'typeClass': 'primitive', 'value': ['Data Resource', 'GeoTIFF']}]}, 'geospatial': {'displayName': 'Geospatial Metadata', 'fields': [{'typeName': 'geographicCoverage', 'multiple': True, 'typeClass': 'compound', 'value': [{'country': {'typeName': 'country', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'United States'}}]}, {'typeName': 'geographicUnit', 'multiple': True, 'typeClass': 'primitive', 'value': []}]}, 'computationalworkflow': {'displayName': 'Computational Workflow Metadata', 'fields': []}, 'customCAFEDataLocation': {'displayName': 'Metadata About Geospatial Files', 'fields': [{'typeName': 'cafeIncludesGeospatialFile', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'Yes'}]}, 'customCAFEDataSources': {'displayName': 'Metadata About Data Sources', 'fields': [{'typeName': 'cafeDerivedFromExistingDataset', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'Yes'}, {'typeName': 'cafeSourceData', 'multiple': True, 'typeClass': 'compound', 'value': [{'cafeSourceDataTitle': {'typeName': 'cafeSourceDataTitle', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'cafeSourceDataAuthor': {'typeName': 'cafeSourceDataAuthor', 'multiple': True, 'typeClass': 'primitive', 'value': []}, 'cafeSourceDataInstitution': {'typeName': 'cafeSourceDataInstitution', 'multiple': False, 'typeClass': 'primitive', 'value': 'Global Surface Urban Heat Island (GSUHI)'}, 'cafeSourceDataVersionNumber': {'typeName': 'cafeSourceDataVersionNumber', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'cafeSourceDataDOIOrURL': {'typeName': 'cafeSourceDataDOIOrURL', 'multiple': False, 'typeClass': 'primitive', 'value': 'https://yceo.users.earthengine.app/view/uhimap'}, 'cafeSourceDataLastModifiedDate': {'typeName': 'cafeSourceDataLastModifiedDate', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'cafeSourceDataDateObtained': {'typeName': 'cafeSourceDataDateObtained', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'cafeSourceDataType': {'typeName': 'cafeSourceDataType', 'multiple': False, 'typeClass': 'controlledVocabulary', 'value': 'Other (specify)'}, 'cafeSourceDataTypeOther': {'typeName': 'cafeSourceDataTypeOther', 'multiple': False, 'typeClass': 'primitive', 'value': ''}, 'cafeSourceDataDisclaimer': {'typeName': 'cafeSourceDataDisclaimer', 'multiple': False, 'typeClass': 'primitive', 'value': ''}}]}]}}}}

[2026-04-01T13:24:26.194-0400] {_client.py:1038} INFO - HTTP Request: POST https://dataverse.harvard.edu/dataverse/api/v1/dataverses/accelTest/datasets "HTTP/1.1 202 Accepted"

[2026-04-01T13:24:26.195-0400] {dataverse_connector.py:217} INFO - response: <Response [202 Accepted]>

[2026-04-01T13:24:26.196-0400] {dataverse_connector.py:218} INFO - success? True

[2026-04-01T13:24:26.196-0400] {dataverse_connector.py:224} INFO - formatting dissem result from: 

[2026-04-01T13:24:26.196-0400] {dataverse_connector.py:240} ERROR - ERROR - Could not create dataverse dataset: Expecting value: line 1 column 1 (char 0)

[2026-04-01T13:24:26.196-0400] {accel_to_dataverse_dissemination.py:94} ERROR - error disseminating payload: ERROR - Could not create dataverse dataset: Expecting value: line 1 column 1 (char 0)

 

Philip Durbin

unread,
Apr 8, 2026, 4:33:52 PM (4 days ago) Apr 8
to dataverse...@googlegroups.com
Hi Mike!

Nice to see you here. I'm having trouble with the dict you shared. I get this error when I try to create a dataset from it:

{'status': 'ERROR', 'message': 'Validation Failed: Point of Contact E-mail nor...@xxx.org  is not a valid email address...

Can you please take a look at https://github.com/gdcc/dataverse-recipes/pull/32 ?

It's a pull request I just made to use pyDataverse, like you are, to create a dataset.

export API_TOKEN=[REDACTED]
export COLLECTION=pdurbin

% python create_dataset.py dataset-finch1.json
Using base URL from $BASE_URL.
Using API token from $API_TOKEN.
Using collection (dataverse) from $COLLECTION.
Printing version of Dataverse
{'status': 'OK', 'data': {'version': '6.10 bugfixes', 'build': None}}
Creating a dataset in collection pdurbin
Dataset with pid 'doi:10.70122/FK2/AVQWKM' created.
{'status': 'OK', 'data': {'id': 2685491, 'persistentId': 'doi:10.70122/FK2/AVQWKM'}}


The dataset-finch1.json file above is the minimal example from the API Guide. Can you please send me the JSON you're trying to use? I might also ask that you temporarily give me permission to create a dataset in the CAFE collection. Otherwise, the metadata blocks won't be set up.

Also, it might be easier to go back and forth on our chat platform, Zulip. Please feel free to create a topic under #troubleshooting or #python about this at https://dataverse.zulipchat.com (or you can DM me).

Thanks,

Phil


--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/0e606b57-2013-4528-ab94-9af78eb1bb6cn%40googlegroups.com.


--

Mike Conway

unread,
Apr 8, 2026, 4:45:13 PM (4 days ago) Apr 8
to dataverse...@googlegroups.com
Hey phillip I had just removed that address when sharing the log, you could stick any valid email address in there

You received this message because you are subscribed to a topic in the Google Groups "Dataverse Users Community" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/dataverse-community/Kbc1vJ689yk/unsubscribe.
To unsubscribe from this group and all its topics, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/CABbxx8Fp2RwrWc41TUN%3D9d2y%2B6jRRbEtHCa51G0_Vu-xDjG3hg%40mail.gmail.com.

Philip Durbin

unread,
Apr 8, 2026, 5:15:09 PM (4 days ago) Apr 8
to dataverse...@googlegroups.com
Ok, but I'm still blocked without your JSON.

Philip Durbin

unread,
Apr 8, 2026, 5:30:14 PM (4 days ago) Apr 8
to dataverse...@googlegroups.com
Actually, pyDataverse author Jan Range converted your dict to JSON for me. But can you give me write access to CAFE? So I can try to reproduce the problem you're seeing?

Mike Conway

unread,
Apr 8, 2026, 5:57:53 PM (4 days ago) Apr 8
to dataverse...@googlegroups.com
Ahh sure! I'll set this up first thing!

Good  to hear from you.

M

Philip Durbin

unread,
Apr 9, 2026, 12:46:49 PM (3 days ago) Apr 9
to dataverse...@googlegroups.com
Hi Mike,

That 202 error is very likely specific to Harvard Dataverse and the AWS Elastic Load Balancer (ELB) we use. Given that, can you please email sup...@dataverse.harvard.edu so we can track it with the ticketing system for Harvard Dataverse?

Thanks!

Phil

Mike Conway

unread,
Apr 9, 2026, 12:52:15 PM (3 days ago) Apr 9
to dataverse...@googlegroups.com
Absolutely and thanks for all the great support

Reply all
Reply to author
Forward
0 new messages