atom feed8 messages in org.apache.orc.issues[jira] [Commented] (ORC-42) Advance H...
FromSent OnAttachments
Dinesh S. Atreya (JIRA)Feb 14, 2016 8:01 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 8:04 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 8:21 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 8:34 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 10:42 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 10:45 pm 
Dinesh S. Atreya (JIRA)Feb 14, 2016 10:49 pm 
Dinesh S. Atreya (JIRA)Feb 15, 2016 12:35 pm 
Subject:[jira] [Commented] (ORC-42) Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA
From:Dinesh S. Atreya (JIRA) (ji@apache.org)
Date:Feb 14, 2016 8:34:48 pm
List:org.apache.orc.issues

[
https://issues.apache.org/jira/browse/ORC-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15146928#comment-15146928
]

Dinesh S. Atreya commented on ORC-42:

-------------------------------------

{panel:title=INDEX_ORC} _*Once comprehensive index processing capabilities is added to ORC i.e, Hadoop,
it can be used to build indexes to other types of file in Hadoop.*_

Some candidate index types are given below * Binary-Tree * B-Tree * B+-Tree * Bit-Map * Search Indexes

Search engines such as Solr, Elastic-Search etc. can use these index processing
capabilities. {panel}

Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA

---------------------------------------------------------------

Key: ORC-42 URL: https://issues.apache.org/jira/browse/ORC-42 Project: Orc Issue Type: New Feature Reporter: Dinesh S. Atreya

Link to Umbrella JIRA https://issues.apache.org/jira/browse/HADOOP-12620 See
https://issues.apache.org/jira/browse/HADOOP-12620?focusedCommentId=15046300&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15046300
for more details. This JIRA is an umbrella (parent/master) JIRA for advancing ORC given
https://issues.apache.org/jira/browse/HDFS-9607. A number of capabilities that can be added to ORC once HDFS update is supported
may include: JSON_ORC -- native processing of JSON (add MongoDB/CouchDB type capabilities in
Hadoop) XML_ORC -- add native XML processing capability to ORC. RDF_ORC -- native processing of RDF documents MVCC_ORC -- Add Multi Version Concurrency MVCC support to ORC INDEX_ORC -- Create a variety of Indexes such as B-Tree, Bitmap etc. to other
files in Hadoop.