FAILED: Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask
Error happens like above. I don't know why I think I have jar file in location of /usr/lib/hive/lib
with name as of hive-serde-0.10.0-cdh4.2.1.jar
create table timesheet
(BibNum bigint,
Title string,
Author string,
ISBN string,
PublicationYear
string,
Publisher string,
Subjects string,
ItemType string,
ItemCollection String,
FloatingItem string, ItemLocation string,
ReportDate string, ItemCount int
) row format serde 'com.bizo.hive.serde2.csv.CSVSerde'
with serdeproperties ( "seperator 'com.bizo.hive.serde.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quo.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quoteChar" ="'", "escapeChar" = "\\" )
Stored as textfile; this is my command I used for:
To learn how to use Hive
try downloading a newer VM like HDP 2.6 or spinning up a dataproc / emr instance in the cloud. Datalab has a good number of tutorials
Honestly, I'd try to learn spark. Then you can use Databricks Community Edition to learn in your browser