Skip to content

Musidata

the official Rondo DB blog

Tag: enriching

Matching Entities, a Generic Algorithm

23 July 201924 July 2019 Mickaël A

In music or any data-related field, enriching data from an external source is a common still tedious task. External data can come from open datasets like wikipedia, web crawling (if legal) or any other data sources. The main idea is that you own locally your dataset, public or not, and you want to enrich it with public and rich data. In music, Musicbrainz and Discogs publish such datasets, that can be used for matching. As I recently worked with the RISM dataset (Répertoire International des Sources Musicale), I will use this as an example, to match person entities.

However, the algorithm I’ll build for you is source and field (and language) agnostic.

Continue reading “Matching Entities, a Generic Algorithm” →

Tagged enriching, matching, metadataLeave a comment

Recent Posts

  • Matching Entities, a Generic Algorithm
  • Translating track titles, part 2
  • Translating track titles
  • About Release Dates
  • How to differentiate homonym artists

Liens

  • Rondo DB
Follow Musidata on WordPress.com

Type your e-mail address to be notified for my next posts

Join 9 other subscribers
Create a free website or blog at WordPress.com.
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
  • Follow Following
    • Musidata
    • Already have a WordPress.com account? Log in now.
    • Musidata
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar