软件 | 版本 | 依赖 |
---|---|---|
Knox | 1.1.0 | |
Ranger | 2.0.0 | MySQL, Solr |
MySQL(mariadb) | 5.5.65 | |
Solr | 8.6.3 | ZooKeeper |
ZooKeeper | 3.6.2 | |
Hadoop | 3.1.4 | |
Hive | 3.1.2 | Hadoop |
HBase | 2.2.6 | Hadoop, ZooKeeper |
Phoenix | Hbase | |
Spark | 2.4.7 | Hadoop |
Ambari | 2.7.5 | |
ealsticsearch | 7.9.2 | |
Kibana | 7.9.2 | elasticsearch |
Clickhouse | 20.12.3.3 | ZooKeeper |
Kafka | kafka_2.12-2.7.0 | ZooKeeper |
Waterdrop | 2.0.4 | Spark |
文档:
-
Hadoop: https://hadoop.apache.org/docs/r3.1.4/hadoop-project-dist/hadoop-common/ClusterSetup.html
-
Knonx: http://knox.apache.org/books/knox-1-1-0/user-guide.html
-
Hive:
-
Hbase: https://hbase.apache.org/book.html#quickstart_fully_distributed
-
ZooKeeper: https://zookeeper.apache.org/doc/r3.6.2/zookeeperStarted.html
主机IP和主机名:
10.0.0.11 centos01
10.0.0.12 centos02
10.0.0.13 centos03
10.0.0.14 centos04
部署目录:
/home/servers
部署用户:hadoop、root
以上4台机器与外网不通,需要使用自己的电脑网卡,或者独立的无线网卡,连接以下任意一个网络(该网络是4台机器连接的无线路由):
- TP-LINK_598F
- TP-LINK_5G_598F
网卡使用tenda U12无线网卡,网卡驱动下载地址:
组件 | 启动用户 | 依赖 | centos01 | centos02 | centos03 | centos04 |
---|---|---|---|---|---|---|
HDFS | hadoop | NameNode | DataNode、SecondaryNameNode | DataNode | DataNode | |
YARN | hadoop | ResourceManager | DataManager | DataManager | DataManager | |
YARN JobHistory | hadoop | JobHistoryServer | JobHistoryServer | JobHistoryServer | JobHistoryServer | |
Hive | hadoop | MySQL | metastore、hiveserver2 | |||
ZooKeeper | hadoop | zk | zk | zk | ||
HBase | hadoop | ZooKeeper | HMaster | HRegionServer | HRegionServer | HRegionServer |
Spark | hadoop | YARN、 metastore | thriftserver | |||
Spark HistoryServer | hadoop | HistoryServer | ||||
MySQL | root | mysql server | ||||
Solr | root | ZooKeeper | solr | solr | solr | |
Knox | hadoop | knox | ||||
Ranger | root | MySQL、Solr | ranger-admin、ranger-usersync | |||
elasticsearch | hadoop | master | data | data | data | |
Kibana | hadoop | Kibana | ||||
Kafka | root | ZooKeeper | kafka broker |
注:
- 安装ranger的各种plugin时也要使用root用户,即enable-xxx-plugin.sh、disable-xxx-plugin.sh脚本用root执行。
- jdbc:mysql://centos01:3306 hive数据库的用户名/密码:hive/2020root
gateway登录密码:admin/admin-password
- UI:http://centos01:6080 用户名/密码:admin/2020root
- UI:http://10.0.0.11:9870/
- Namenode:hdfs://centos01:8020
- WebHDFS:http://centos01:9870/webhdfs
- UI:http://10.0.0.11:8088/cluster
- JobHistory UI:http://10.0.0.11:19888/jobhistory
-
metastore:thrift://centos01:9083
-
warehouse(在HDFS上):http://10.0.0.11:8020/user/hive/warehouse
-
hiveserver2 thrift:thrift://centos01:10000
-
hiveserver2 http(未启用):http://centos01:10001
-
hiveserver2 UI:http://10.0.0.11:10002
- History Server UI:http://centos01:18080
- thriftserver:thrift://centos01:10003
- WebHBase:http://centos01:60080
- centos01:9092