BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages¶
BlaBla¶
At the heart of BlaBla is the DocumentProcessor
and the Document
class. You have to import the DocumentProcessor
class to process a piece of input text as shown in the below piece of code.
from bla_bla.document_processor import DocumentProcessor
with DocumentProcessor("stanza_config/stanza_config.yaml", "en") as doc_proc:
content = "The picture shows a boy walking to the kitchen to pick a cookie from the cookie jar."
doc = doc_proc.analyze(content, "string")
res_json = doc.compute_features("noun_rate")
print(res_json)
Under the hood, the DocumentProcessor
object has an analyze
method that will return an object of type Document
class which can be used to compute features