Quickly test HDFS 2 Source connector.
Simply run:
$ ./hdfs2.sh
Steps from connect-hdfs2-sink
Creating HDFS Source connector:
$ curl -X PUT \
-H "Content-Type: application/json" \
--data '{
"connector.class":"io.confluent.connect.hdfs2.Hdfs2SourceConnector",
"tasks.max":"1",
"hdfs.url":"hdfs://namenode:9000",
"hadoop.conf.dir":"/etc/hadoop/",
"format.class" : "io.confluent.connect.hdfs2.format.avro.AvroFormat",
"confluent.topic.bootstrap.servers": "broker:9092",
"confluent.topic.replication.factor": "1",
"transforms" : "AddPrefix",
"transforms.AddPrefix.type" : "org.apache.kafka.connect.transforms.RegexRouter",
"transforms.AddPrefix.regex" : ".*",
"transforms.AddPrefix.replacement" : "copy_of_$0"
}' \
http://localhost:8083/connectors/hdfs2-source/config | jq .
Verifying topic copy_of_test_hdfs
:
$ docker exec connect kafka-avro-console-consumer -bootstrap-server broker:9092 --property schema.registry.url=http://schema-registry:8081 --topic copy_of_test_hdfs --from-beginning --max-messages 9
Results:
{"f1":"value1"}
{"f1":"value2"}
{"f1":"value3"}
{"f1":"value4"}
{"f1":"value5"}
{"f1":"value6"}
{"f1":"value7"}
{"f1":"value8"}
{"f1":"value9"}
N.B: Control Center is reachable at http://127.0.0.1:9021