Multilingual masked language modeling involves training an AI model on text from several languages, and it’s a technique that’s been used to great effect. In March, a team introduced an architecture that can jointly learn sentence representations for 93 languages belonging to more than 30 different families. But most previous work in language modeling has investigated cross-lingual […]
I’m impressed by Apple’s new Research app for the iPhone and Apple Watch. It’s beautifully designed, ambitious in scope, and so intimately tied into the iOS and watchOS operating systems that it could serve as a singular example of Apple’s “seamless integration of hardware, software, and services.”
What’s a data scientist to do if they lack sufficient data to train a machine learning model? One potential avenue is synthetic data generation, which researchers at IBM Research advocate in a newly published preprint paper. They used a pretrained machine learning model to artificially synthesize new labeled data for text classification tasks. They claim that […]
GitHub today launched the GitHub Security Lab, an ongoing effort to protect open source code projects. The GitHub Security Lab is aimed at bringing together security researchers from partner organizations like Google, Microsoft, Mozilla, Oracle, Uber, and HackerOne.
Ever wonder how to pronounce a particularly challenging word? You’re in luck. Google announced that it’s rolling out a tool in mobile Search that allows you to practice pronunciations without leaving the results page. Coinciding with its debut, the tech giant this morning added images to its Search dictionary and translation features intended to help […]