Label Studio Release Notes

Brand new PIP package, NPM package for the embeds, and Pairwise

Hello there!

You’re reading the newsletter for a Label Studio release. Labeling and annotating the data is a tedious process - we know that. Last two weeks before the end of the year, we’ve decided to do a retrospective and identify how to make it less so.

Some time ago, we added the support for assisted machine learning, where you can use different models to help you annotate the dataset semi-automatically. But it was not so easy to get up and running, now it shall become much easier to create and manage new projects.

However, there is a fine line between simplicity and feature-richness. We’re still trying to identify it and would like to ask for the feedback on that release.

Simplifying the usage

With the release of the PIP package, it’s never been easier to get up and running. This code block is everything you need to execute to get the server running (note it requires >=Python3.6):

pip install label-studio

label-studio init labeling_project

label-studio start labeling_project

After the server starts, you get a new interface with support for multi-format imports and live preview of the labeling interface you’re creating.

Pairwise Comparison

Comparing entities in pairs to judge which is preferred, or has a higher amount of some quantitative property, or whether or not the two entities are identical. You can compare any Label Studio supported object - Texts, Images, Audios, or plain HTML between each other.

Here you can see the comparison between two HTML blocks displaying dialogues.

Separating the frontend

Along with releasing the PIP package, the frontend part moves into its own repository and a neat NPM package. If you’re looking into embedding the annotation into your application, all you have to do is

npm install label-studio

There are also docs on how to run it in development mode and extend further by adding custom annotation types. For example, videos or time series.

Till next time

We’ve postponed some features from the last release to concentrate more on ease of use, but those are in the pipeline, and we will slowly introduce it here and on our Slack channel, something that is coming:

  • Image Segmentation with Brushes, Livewire and Floodfill

  • Transformers model connector for the assisted labeling

Visit Github

That’s all we’ve got so far. I hope you had a great Christmas and new year celebration. Hope to hear back from you.

Till next time!

Michael, Label Studio team

HTML Annotation, Grid View, Annotation and Labeling tool

Hello there!

This is the first Label Studio Release newsletter. Since we've released the tool into the open-source around four months ago, it got great feedback from the community, valuable suggestions on where we need to focus our engineering efforts, and of course, bug reports.

To remind if you've subscribed some time ago and don't remember what our tool is about: Label Studio is a multi-type data labeling and annotation tool, it's built with the simplicity of interface in mind, and the look is fully configurable. You make the tool yourself in 10-20 lines of the config language. As a side effect of this, it is also can be embedded in your applications. Right now, it supports Images, Text, Audios, and HTML, with the goal to cover all the domains.

Now to the features of this latest (0.3.0) release and the plans for the next releases

HTML Annotation

A new annotation type! HTML is the most fundamental building block of the web, and a de facto standard to create webpages. But not only that, what makes HTML annotation powerful is how many other formats can be converted into it. For example, PDFs, now you can annotate things like resumesresearch, and legal papers. Another one is excel sheets that are used for invoices and research papers. All of those can be converted into HTML and annotated. And it gives you a lot of freedom on the new data types without losing any visual representation as with text.

Still in its early days, but functional enough to be able to annotate data that comes, for example, from the excel sheets. On the animation above, you can see an invoice exported as HTML is being annotated.

Grid View - Compare annotations side by side

Often we've seen a scenario where you'd explore and compare the results created by different annotators. To make things easier, we introduce the grid view. You get all the results on one page and can compare it side by side. This way, you immediately see the differences between the annotations and explore it further.

Here you can quickly spot how different the audio segmentations are.

Coming soon

  • Image Segmentation with Brushes, Livewire and Floodfill

  • Transformers model connector for the assisted labeling

I guess this is it for now. We’d love to hear from you and get feedback on our first newsletter, what is missing, what can be improved to make it more engaging and interesting for you? Thanks for reading, enjoy your Christmas and create happy memories!

Till next time!

Michael, Label Studio team

