Facebook’s ‘Rosetta’ system helps the company understand memes

Memes are the language of the web and Facebook wants to better understand them.

Facebook’s AI teams have made substantial advances over the years in both computer vision and natural language recognition. Today, they’ve announced some of their latest work that works to combine advances in the two fields. A new system, codenamed “Rosetta,” helps teams at Facebook and Instagram identify text within images to better understand what their subject is and more easily classify them for search or to flag abusive content.

It’s not all memes, the tool scans over a billion images and video frames daily across multiple languages in real time, according to a company blog post.

Rosetta makes use of recent advances in optical character recognition (OCR) to first scan an image and detect text that is present, at which point the characters are placed inside a bounding box that is then analyzed by convolutional neural nets that try to recognize what’s being communicated.

via Facebook

Facebook has plenty of reasons to be interested in the text that are accompanying videos or photos, particularly in regards to content moderation.

While identifying spam is pretty straight forward when the text description of a photo is “Bruh!!! 🤣🤣🤣” or “1 like = 1 prayer,” but videos and photos that employ similar techniques seemed to be more present in timelines as Facebook tweaks its algorithm to promote “time well spent.” The same goes for hate speech which can much more easily be shared when all the messaging is encapsulated in one image or video which makes text overlays a useful tool.

The company says that this system presents new challenges for the company in terms of multi-language support as it’s currently running off a unified model for languages and the bulk of available training data is currently in the Latin alphabet. In the company’s research paper, the team details that it has some strategies to conjure up new language support by repurposing existing databases.

As Facebook looks to offload work from human content moderators and allow its news feed algorithms to sort content based on assigned classifications, a tool like this has a lot of potential to shape how Facebook identifies harmful content but also put more interesting content in front of you.



from www.tech-life.in
Share:

Related Posts:

No comments:

Post a Comment

Search This Blog

Blog Archive

Powered by Blogger.

Edo raises $12M from Breyer Capital to measure TV ad effectiveness

Edo , an ad analytics startup founded by Daniel Nadler and actor Edward Norton, announced today that it has raised $12 million in Series A f...

Unordered List

  • Lorem ipsum dolor sit amet, consectetuer adipiscing elit.
  • Aliquam tincidunt mauris eu risus.
  • Vestibulum auctor dapibus neque.

Sample Text

Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation test link ullamco laboris nisi ut aliquip ex ea commodo consequat.

Pages

Theme Support

Need our help to upload or customize this blogger template? Contact me with details about the theme customization you need.