Jump to content
Wikipedia The Free Encyclopedia

Apache cTAKES

From Wikipedia, the free encyclopedia
Natural language processing system
Apache cTAKES
Apache cTAKES Logo
Developer Apache Software Foundation
Stable release
6.0.0 / September 16, 2024; 14 months ago (2024年09月16日)
Repository cTakes Repository
Written inJava, Scala, Python
Operating system Cross-platform
Type Natural language processing, Bioinformatics, Text mining, Information Extraction
License Apache License 2.0
WebsiteOfficial website Edit this at Wikidata

Apache cTAKES: clinical Text Analysis and Knowledge Extraction System is an open-source Natural Language Processing (NLP) system that extracts clinical information from electronic health record unstructured text. It processes clinical notes, identifying types of clinical named entities — drugs, diseases/disorders, signs/symptoms, anatomical sites and procedures. Each named entity has attributes for the text span, the ontology mapping code, context (family history of, current, unrelated to patient), and negated/not negated.[1]

cTAKES was built using the UIMA Unstructured Information Management Architecture framework and OpenNLP natural language processing toolkit.[2] [3]

Components

[edit ]

Components of cTAKES are specifically trained for the clinical domain, and create rich linguistic and semantic annotations that can be utilized by clinical decision support systems and clinical research.[4]

These components include:

  • Named Section identifier
  • Sentence boundary detector
  • Rule-based tokenizer
  • Formatted list identifier
  • Normalizer
  • Context dependent tokenizer
  • Part-of-speech tagger
  • Phrasal chunker
  • Dictionary lookup annotator
  • Context annotator
  • Negation detector
  • Uncertainty detector
  • Subject detector
  • Dependency parser
  • patient smoking status identifier
  • Drug mention annotator

History

[edit ]

Development of cTAKES began at the Mayo Clinic in 2006. The development team, led by Dr. Guergana Savova and Dr. Christopher Chute, included physicians, computer scientists and software engineers. After its deployment, cTAKES became an integral part of Mayo's clinical data management infrastructure, processing more than 80 million clinical notes.[5]

When Dr. Savova's moved to Boston Children's Hospital in early 2010, the core development team grew to include members there. Further external collaborations include:[5]

Such collaborations have extended cTAKES' capabilities into other areas such as Temporal Reasoning, Clinical Question Answering, and coreference resolution for the clinical domain.[5]

In 2010, cTAKES was adopted by the i2b2 program and is a central component of the SHARP Area 4.[5]

In 2013, cTAKES released their first release as an Apache Software Foundation incubator project: cTAKES 3.0.[citation needed ]

In March 2013, cTAKES became an Apache Software Foundation Top Level Project (TLP).[5]

See also

[edit ]

References

[edit ]
  1. ^ Denecke, Kerstin (2015年08月31日). "Tools and Resources for Information Extraction". Health Web Science: Social Media Data for Healthcare. Springer. p. 67. ISBN 978-3-319-20582-3 – via Google Books.
  2. ^ Khalifa, Abdulrahman; Meystre, Stéphane (2015年12月01日). "Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes". Journal of Biomedical Informatics. Proceedings of the 2014 i2b2/UTHealth Shared-Tasks and Workshop on Challenges in Natural Language Processing for Clinical Data. 58 (Supplement): S128 – S132. doi:10.1016/j.jbi.201508002. PMC 4983192 . PMID 26318122.
  3. ^ Khudairi, Sally (2017年04月25日). "The Apache Software Foundation Announces Apache® cTAKESTM v4.0" (Press release). Forest Hill, MD: The Apache Software Foundation. Globe Newswire. Retrieved 2017年09月20日.
  4. ^ Savova, Guergana K; Masanz, James J; Ogren, Philip V; Zheng, Jiaping; Sohn, Sunghwan; Kipper-Schuler, Karin C; Chute, Christopher G (2010). "Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications". Journal of the American Medical Informatics Association. 17 (5): 507–513. doi:10.1136/jamia.2009.001560. ISSN 1067-5027. PMC 2995668 . PMID 20819853.
  5. ^ a b c d e "History". Apache cTAKESTM - clinical Text Analysis Knowledge Extraction System. 2015年06月22日. Retrieved 2018年01月11日.
[edit ]
Top-level
projects
Commons
Incubator
Other projects
Attic
Licenses
Health software
Barcoding
Databases
Diagnostics
Bioimaging
DICOM
General
Servers
Heuristics
Odontologic
Electronic
health records
Platforms
Terminology
Laboratory
management
Patient portals
Practice
management
Comprehensive
Specialty
Scheduling
Patient engagement
Research
Surgical
Assistive
Transmission
Related

AltStyle によって変換されたページ (->オリジナル) /