Skip to content

Latest commit

 

History

History
58 lines (42 loc) · 1.48 KB

README.md

File metadata and controls

58 lines (42 loc) · 1.48 KB

Welcome to your new Kafka Connect connector!

Kafka_connector

Generate a connector (Source Kafka Connector)

Setup:

  1. JDK version 1.8
  2. Generate new project with IntelliJ IDEA with Maven Archetype.
  3. Configure:
  1. GroupID= io.confluent.maven
  2. ArtifactID: kafka-connect-quickstart
  3. Version: 0.10.2.1-cp1

How to run

Building packages:

  mvn clean package
  mvn assembly:single  # check pom.xml for assembly & jar package

Configuration.

  1. Export classpath:
   Export CLASSPATH="$(find target -type f -name '*.jar'| grep '\-package' | tr '\n'':')"
  1. Build the Dockerfile:
   docker built . -t quangtn/kafka-connect-source-github:1.0
  1. Run the docker:
docker run -e CLASSPATH=$CLASSPATH --net=host --rm -t -v $(pwd)/offsets:/kafka-connect-source-github/offsets quangtn/kafka-connect-source-github:1.0

Note:

For sink connector (custom), I wil update it in another project. Besides, sink connector can be used with Elasticsearch or Database.

Architecture

Kafka Connector architecture

kafka_connector_architecture

Kafka Connect Cluster Distributed Architecture

distributed_architecture

Data Pipeline

data_pipeline_AWS