BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages

BlaBla

At the heart of BlaBla is the DocumentProcessor and the Document class. You have to import the DocumentProcessor class to process a piece of input text as shown in the below piece of code.

from bla_bla.document_processor import DocumentProcessor
with DocumentProcessor("stanza_config/stanza_config.yaml", "en") as doc_proc:
    content = "The picture shows a boy walking to the kitchen to pick a cookie from the cookie jar."
    doc = doc_proc.analyze(content, "string")
    res_json = doc.compute_features("noun_rate")
    print(res_json)

Under the hood, the DocumentProcessor object has an analyze method that will return an object of type Document class which can be used to compute features

Features Table