UIMA
UIMA (/juˈiːmə/ yoo-EE-mə),[1] short for Unstructured Information Management Architecture, is an OASIS standard[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.
Structure
The UIMA architecture can be thought of in four dimensions:
- It specifies component interfaces in an analytics pipeline.
- It describes a set of design patterns.
- It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
- It suggests development roles allowing tools to be used by users with diverse skills.
Implementations and uses
Developer(s) | IBM, Apache Software Foundation (since October 2006) |
---|---|
Stable release | 3.1.1 / November 8, 2019; 4 years ago (2019-11-08)[3] |
Repository |
|
Written in | Java with C++ enablement |
Operating system | cross-platform |
Type | text mining, information extraction |
License | Apache License 2.0 |
Website | uima |
Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.
UIMA is used in a number of software projects:
- IBM Research's Watson uses UIMA for analyzing unstructured data.[4]
- The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
- DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.
See also
- Data Discovery and Query Builder
- Entity extraction
- General Architecture for Text Engineering (GATE)
- IBM Omnifind
- LanguageWare
References
- ^ UIMA Frequently Asked Questions (FAQ's) The Apache Software Foundation
- ^ UIMA Specification The Apache Software Foundation.
- ^ "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.
- ^ "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.
External links
- Apache UIMA home page
- v
- t
- e
projects
- Accumulo
- ActiveMQ
- Airavata
- Airflow
- Allura
- Ambari
- Ant
- Aries
- Arrow
- Apache HTTP Server
- APR
- Avro
- Axis
- Axis2
- Beam
- Bloodhound
- Brooklyn
- Calcite
- Camel
- CarbonData
- Cassandra
- Cayenne
- CloudStack
- Cocoon
- Cordova
- CouchDB
- cTAKES
- CXF
- Derby
- Directory
- Drill
- Druid
- Empire-db
- Felix
- Flex
- Flink
- Flume
- FreeMarker
- Geronimo
- Groovy
- Guacamole
- Gump
- Hadoop
- HBase
- Helix
- Hive
- Iceberg
- Ignite
- Impala
- Jackrabbit
- James
- Jena
- JMeter
- Kafka
- Kudu
- Kylin
- Lucene
- Mahout
- Maven
- MINA
- mod_perl
- MyFaces
- Mynewt
- NiFi
- NetBeans
- Nutch
- NuttX
- OFBiz
- Oozie
- OpenEJB
- OpenJPA
- OpenNLP
- OрenOffice
- ORC
- PDFBox
- Parquet
- Phoenix
- POI
- Pig
- Pinot
- Pivot
- Qpid
- Roller
- RocketMQ
- Samza
- Shiro
- SINGA
- Sling
- Solr
- Spark
- Storm
- SpamAssassin
- Struts 1
- Struts 2
- Subversion
- Superset
- SystemDS
- Tapestry
- Thrift
- Tika
- TinkerPop
- Tomcat
- Trafodion
- Traffic Server
- UIMA
- Velocity
- Wicket
- Xalan
- Xerces
- XMLBeans
- Yetus
- ZooKeeper
- Category