TAUS Launches the Data Marketplace

The first and only language data monetization and acquisition platform.

TAUS Data Marketplace

TAUS has launched the first release of the Data Marketplace. The Data Marketplace is a platform for all stakeholders in the global language and AI industries to trade, clean, cluster and curate language data. The development of the Data Marketplace is a collaborative project between TAUS, Translated, and FBK Trento and is co-financed by the European Union under the Connecting European Facility Program to disintermediate the language data supply chain by connecting data producers and consumers directly. 

The Data Marketplace allows owners and producers of data to monetize their data and trainers and developers of MT and AI systems can shop and buy high-quality data tuned to their required domains and specifications. Data Marketplace currently offers data cleaning and anonymization services. A Matching Data or clustered search feature, allowing users to build their own corpora tuned to their domains and needs, will be part of the next release of the Data Marketplace in June 2021.

"The Data Marketplace establishes a more equal level playing field for the thousands of companies in the world that want to optimize automatic translation and invest in language-based AI," says Jaap van der Meer, Director of TAUS. "Today access to sufficient language data is precluded to a few big-tech companies who can afford to invest. The Data Marketplace makes access to language data universal and affordable by connecting language data producers and consumers directly."

Sellers on the Data Marketplace come from different backgrounds such as publishing companies, data companies, language service providers and buyers, and translators who look for ways to monetize the language data they've collected or generated over time.

"The Data Marketplace comes just at the right time to support the grand shifts in the translation and AI industries and to realize the best results out of the investments in translation automation," says Marco Trombetti, CEO at Translated. "The Data Marketplace is a great opportunity for translators to monetize their resources as well as a step forward for MT players in support of language expansion and domain diversification."

The Data Marketplace is launched with the largest collection of language data (more than 35B words in 600+ language pairs) and it can help users of MT and AI to expand into new languages, new domains and new applications very quickly. 

"It had never occurred to me that my hard-earned solutions to thorny translation issues could one day be made available to the public, thus allowing me to share useful knowledge to colleagues, and also offering me some extra reward for my past efforts," says Nicoletta Aresca, Translator and seller on the Data Marketplace.

Data Marketplace is established based on a strong legal framework that complies with privacy policies in Europe and North America, therefore, the legal review committee puts great emphasis on the importance of transparency on the origin and usage of language data.

Learn more about the Data Marketplace

Source: TAUS

About TAUS

TAUS is the language data network offering the largest industry-shared repository of data, deep know-how in language engineering and a network of Human Language Project workers around the globe.

Danzigerkade 65A
Amsterdam, Noord-Holland
1013 AP