Insert streaming data to hawq

How to insert streaming data to hawq and execute query on online data.

I teste jdbc insert and performance was very bad.
After that i tested writing data to hdfs with flume and created external table in hawq, but hawq can't read data until flume close the file. the problem is that if i set flume file rolling very low (1 min) after some days number of files goes up and this is not good for hdfs.
Third solution is hbase, but because most of my queries are aggregation on many data, hbase is not a good solution(hbase is good for getting single data).

So with these constraints, what is a good solution to query streaming data online with hawq?

Solution

if your source data is not on hdfs, you can try gpdfist/named pipe as a buffer with gpfdist external table or web external table using other linux scripts. another solution will be spring xd gpfdist module. http://docs.spring.io/spring-xd/docs/1.3.1.RELEASE/reference/html/#gpfdist

java.lang.NoClassDefFoundError: org/apache/hadoop/fs/StorageStatistics
Ports are not available: listen tcp 0.0.0.0/50070: bind: An attempt was made to access a socket in a way forbidden by its access permissions
how to convert date 2017-sep-12 To 2017-09-12 in HIVE
pySpark Hadoop AWS s3 requester-pays.enabled config doesn't work
HBase Shell - org.apache.hadoop.hbase.ipc.ServerNotRunningYetException: Server is not running yet
UnsatisfiedLinkError while writing to S3 using Staging S3A Committer on Windows
Why do I need to source bash_profile every time
Apache Spark: Get number of records per partition
Unable to exit Hive
can Configuration.set be used in the Mapper?
Loading Files in UDF
Error: `callbackHandler` may not be null when connecting to HDFS using Kerberos in Jakarta EE
how to tune out of memory exception spark
Can't connect from Spark to S3 - AmazonS3Exception Status Code: 400
How to delete and update a record in Hive
What is Google's Dremel? How is it different from Mapreduce?
how to set "api-version" dynamically in fs.azure.account.oauth2.msi.endpoint
NoClassDefFoundError: org/apache/parquet/conf/ParquetConfiguration
Missing PutHDFS Processor in Apache NiFi 2.0.0
Apache Nifi: PutHDFS Processor issue - PutHDFS Failed to write to HDFS java.lang.NoClassDefFoundError: org/apache/hadoop/conf/Configurable
how to check which HDFS datanode ip is returned by namenode to spark?
How to use hadoop with laravel 5.2
java.lang.UnsupportedOperationException: 'posix:permissions'
What is the principle of "code moving to data" rather than data to code?
java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient
How to understand the result of yarn queue status
Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources
connect to host localhost port 22: Connection refused
Where is yarn.nodemanager.log-dirs in spark?
How to change date format in hive?