Dataverse is open source research data repository software that has supported distributed metadata for a long time and is increasingly supporting distributed data.
DataLad-Registry is a service that maintains up-to-date information on over ten thousand datasets, with the collection expanding as more datasets are added.