Apache Hive : HiveContributorsMinutes100601
Notes provided by Namit Jain.
The following people were present:
- Facebook (Paul Yang; Ning Zhang; Yongqiang He; Ahmed Aly; John Sichi; Ashish Thusoo; Namit Jain)
- Netflix (Eva Tse; Jerome Boulon)
- Cloudera (Arvind Prabhakar; Vinithra Varadharajan; Carl Steinbach)
- Yahoo (Alan Gates)
The following were the main meeting minutes:
-
We should have these meetings more often, say every month. Cloudera will host the next meeting.
-
We should try to have a release every 4 months. We should try to push out 0.6 before end of June, For the new release, Cloudera will take a lead on the release management issues and also help with documentation. Documentation for Hive leaves a lot to be desired.
-
The test framework is pretty brittle, and it is pretty difficult for new people to do big contributions without having a very sound test-plan. Ideally, facebook should host a test cluster so that everyone can run tests there.
-
A lot of external customers are asking for ODBC/JDBC support on top of Hive. Cloudera will take the lead on that.
-
The process of making a new committer should be more transparent. In order to grow the community, it would be very desirable to add more committers outside Facebook.
-
Create new components for Drivers (ODBC/JDBC) and UDFs.
-
Yahoo will take the lead of making Hive work on top of Zebra
Some new tasks were identified, but they can change if new priorities come in.
-
Carl will focusing on ‘having’ support and co-related sub-queries.
-
Arvind will be focusing on the cost-based optimizer
The main idea was that we should meet more often and share our ideas. Time-based release will be very desirable.