By Arnaud Hillen on March 1st, 2022
Segments.ai has great labeling tools for computer vision. As multimodal learning is becoming increasingly important, even computer vision teams sometimes need to label other data like text or audio. To support the labeling needs of our users beyond computer vision, we added two new interfaces for text labeling: named entity recognition and span categorization.
Named entity recognition is the task of locating and classifying words and phrases into non-overlapping categories such as names, organizations, locations, etc. Each word can have one category. For example, the sentence “James bought 30 shares of Apple in 2020.” contains the following named entities:
When words can have multiple overlapping labels, you can use our more general span categorization interface. For example, when we also want to label grammar and parts of speech, the annotated sentence now becomes:
To start labeling data with our text interfaces:
Let us know what you think of our text interfaces so we can make them even better in the future! And don’t hesitate to contact us if you have any questions.