glusterfs 的缩容、扩容和替换(转载)

Posted on 2020年1月8日 by Plus7

Source： https://www.cnblogs.com/bfmq/p/9990467.html

1. 扩容操作

1 [root@g1 ~]# gluster peer probe g3                # 将新节点添加到集群里，如果是原本集群内的机器操作则省略
2 peer probe: success. Host g3 port 24007 already in peer list        # 这台机器添加过了
3 [root@g1 ~]# gluster volume info test                             # 此时该卷块设备为4个
4 5 Volume Name: test
6 Type: Distributed-Replicate
7 Volume ID: 92ffe586-ea14-4b7b-9b89-5dfd626cb6d4
8 Status: Started
9 Snapshot Count: 0
10 Number of Bricks: 2 x 2 = 4
11 Transport-type: tcp
12 Bricks:
13 Brick1: g1:/glusterfs/data1
14 Brick2: g2:/glusterfs/data1
15 Brick3: g3:/glusterfs/data1
16 Brick4: g1:/glusterfs/data2
17 [root@g1 ~]# gluster volume add-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 g3:/glusterfs/data3
18 volume add-brick: failed: Incorrect number of bricks supplied 5 with count 2        # 很明显又是之前的块设备与复制数备份问题。因此注意服务器上的磁盘数量要与卷复制数匹配问题，比如一个复制数为3的卷，买了10块盘其中一块是加不进来的
19 [root@g1 ~]# gluster volume add-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3
20 volume add-brick: failed: The brick g1:/glusterfs/data3 is being created in the root partition. It is recommended that you don't use the system's root partition for storage backend. Or use 'force' at the end of the command if you want to override this behavior.        # 我还是用的/所以要强制
21 [root@g1 ~]# gluster volume add-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 force
22 volume add-brick: success
23 [root@g1 ~]# gluster volume info test                 # 卷的块设备变多了
24 25 Volume Name: test
26 Type: Distributed-Replicate
27 Volume ID: 92ffe586-ea14-4b7b-9b89-5dfd626cb6d4
28 Status: Started
29 Snapshot Count: 0
30 Number of Bricks: 4 x 2 = 8
31 Transport-type: tcp
32 Bricks:
33 Brick1: g1:/glusterfs/data1
34 Brick2: g2:/glusterfs/data1
35 Brick3: g3:/glusterfs/data1
36 Brick4: g1:/glusterfs/data2
37 Brick5: g2:/glusterfs/data2
38 Brick6: g3:/glusterfs/data2
39 Brick7: g1:/glusterfs/data3
40 Brick8: g2:/glusterfs/data3
41 [root@g1 ~]# gluster volume rebalance test start                # 让以前的数据再次均匀分布
42 volume rebalance: test: success: Rebalance on test has been started successfully. Use rebalance status command to check status of the rebalance process.
43 ID: a2f4b603-283a-4303-8ad0-84db00adb5a5
44 [root@g1 ~]# gluster volume rebalance test status            # 查看任务状态，要均衡文件较大时需要一段时间
45                                     Node Rebalanced-files          size       scanned      failures       skipped               status  run time in h:m:s
46                                ---------      -----------   -----------   -----------   -----------   -----------         ------------     --------------47                                localhost                2        0Bytes            10             0             0            completed        0:00:00
48                                       g2                1        0Bytes             9             0             0            completed        0:00:00
49                                       g3                3        0Bytes             6             0             0            completed        0:00:00
50 volume rebalance: test: success
51 [root@g1 ~]# gluster volume rebalance test stop                # 等所有状态completed就可以停了
52                                     Node Rebalanced-files          size       scanned      failures       skipped               status  run time in h:m:s
53                                ---------      -----------   -----------   -----------   -----------   -----------         ------------     --------------54                                localhost                2        0Bytes            10             0             0            completed        0:00:00
55                                       g2                1        0Bytes             9             0             0            completed        0:00:00
56                                       g3                3        0Bytes             6             0             0            completed        0:00:00
57 volume rebalance: test: success: rebalance process may be in the middle of a file migration.
58 The process will be fully stopped once the migration of the file is complete.
59 Please check rebalance process for completion before doing any further brick related tasks on the volume.
6061 [root@g1 ~]# gluster volume rebalance test status        # 现在该卷上已经没有在均衡的任务了
62 volume rebalance: test: failed: Rebalance not started for volume test.

2.缩容操作

1 [root@g1 ~]# gluster volume remove-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 start                # 去掉复制倍数的块设备，此时去除的设备上的数据会开始迁移
2 Running remove-brick with cluster.force-migration enabled can result in data corruption. It is safer to disable this option so that files that receive writes during migration are not migrated.
3 Files that are not migrated can then be manually copied after the remove-brick commit operation.
4 Do you want to continue with your current cluster.force-migration settings? (y/n) y
5 volume remove-brick start: success
6 ID: 9b4657c0-ed29-4c75-8bb6-7b8f277f02ec
7 [root@g1 ~]# gluster volume remove-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 status            # 查看迁移状态
8                                     Node Rebalanced-files          size       scanned      failures       skipped               status  run time in h:m:s
9                                ---------      -----------   -----------   -----------   -----------   -----------         ------------     --------------10                                localhost                0        0Bytes            10             0             0            completed        0:00:00
11                                       g2                0        0Bytes             5             0             0            completed        0:00:00
12                                       g3                0        0Bytes             5             0             0            completed        0:00:00
13 [root@g1 ~]# gluster volume remove-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 commit            # completed后从卷中将块设备去除掉
14 volume remove-brick commit: success
15 Check the removed bricks to ensure all files are migrated.
16 If files with data are found on the brick path, copy them via a gluster mount point before re-purposing the removed brick.
17 [root@g1 ~]# gluster volume remove-brick test g2:/glusterfs/data2 g3:/glusterfs/data2 g1:/glusterfs/data3 g2:/glusterfs/data3 status            # 没有任务
18 volume remove-brick status: failed: remove-brick not started for volume test.
19 [root@g1 ~]# gluster volume info test                 # 又变回了以前的4块盘
20 21 Volume Name: test
22 Type: Distributed-Replicate
23 Volume ID: 92ffe586-ea14-4b7b-9b89-5dfd626cb6d4
24 Status: Started
25 Snapshot Count: 0
26 Number of Bricks: 2 x 2 = 4
27 Transport-type: tcp
28 Bricks:
29 Brick1: g1:/glusterfs/data1
30 Brick2: g2:/glusterfs/data1
31 Brick3: g3:/glusterfs/data1
32 Brick4: g1:/glusterfs/data2
33 [root@g1 ~]# ls -a /glusterfs/data3        # 将刚才所有被去除的块设备下隐藏目录删除，否则可能会影响到以后再次使用此块设备加入其它卷
34 .  ..  .glusterfs
35 [root@g1 ~]# rm -fr /glusterfs/data3/.glusterfs/

3.更换操作

1 [root@g1 ~]# gluster volume replace-brick test g3:/glusterfs/data2 g3:/glusterfs/data3 commit force                    # 使用g3:/glusterfs/data3替换掉g3:/glusterfs/data2
2 volume replace-brick: success: replace-brick commit force operation successful
3 [root@g1 ~]# gluster volume info test
4 5 Volume Name: test
6 Type: Distributed-Replicate
7 Volume ID: 92ffe586-ea14-4b7b-9b89-5dfd626cb6d4
8 Status: Started
9 Snapshot Count: 0
10 Number of Bricks: 4 x 2 = 8
11 Transport-type: tcp
12 Bricks:
13 Brick1: g1:/glusterfs/data1
14 Brick2: g2:/glusterfs/data1
15 Brick3: g3:/glusterfs/data1
16 Brick4: g1:/glusterfs/data2
17 Brick5: g2:/glusterfs/data2
18 Brick6: g3:/glusterfs/data3                    # 被换掉了
19 Brick7: g1:/glusterfs/data3
20 Brick8: g2:/glusterfs/data3

其他的维护命令：
https://www.cnblogs.com/jicki/p/5801712.html

1. 查看GlusterFS中所有的volume:
[root@swarm-manager ~]#gluster volume list

2. 删除GlusterFS磁盘： [root@swarm-manager ~]#gluster volume stop models #停止名字为 models 的磁盘 [root@swarm-manager ~]#gluster volume delete models #删除名字为 models 的磁盘

注： 删除 磁盘 以后，必须删除 磁盘( /opt/gluster/data ) 中的 （ .glusterfs/ .trashcan/ ）目录。 
否则创建新 volume 相同的 磁盘 会出现文件 不分布，或者 类型 错乱 的问题。

3. 卸载某个节点GlusterFS磁盘 [root@swarm-manager ~]#gluster peer detach swarm-node-2

4. 设置访问限制,按照每个volume 来限制
[root@swarm-manager ~]#gluster volume set models auth.allow 10.6.0.*,10.7.0.*

5. 添加GlusterFS节点： [root@swarm-manager ~]#gluster peer probe swarm-node-3 [root@swarm-manager ~]#gluster volume add-brick models swarm-node-3:/opt/gluster/data 注：如果是复制卷或者条带卷，则每次添加的Brick数必须是replica或者stripe的整数倍

6. 配置卷 [root@swarm-manager ~]# gluster volume set

7. 缩容volume: 先将数据迁移到其它可用的Brick，迁移结束后才将该Brick移除： [root@swarm-manager ~]#gluster volume remove-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data start

在执行了start之后，可以使用status命令查看移除进度： [root@swarm-manager ~]#gluster volume remove-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data status

不进行数据迁移，直接删除该Brick：
[root@swarm-manager ~]#gluster volume remove-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data commit

注意，如果是复制卷或者条带卷，则每次移除的Brick数必须是replica或者stripe的整数倍。

扩容：
gluster volume add-brick models swarm-node-2:/opt/gluster/data

8. 修复命令:
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data commit -force

9. 迁移volume:
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data start

pause 为暂停迁移
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data pause

abort 为终止迁移
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data abort

status 查看迁移状态
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data status

迁移结束后使用commit 来生效
[root@swarm-manager ~]#gluster volume replace-brick models swarm-node-2:/opt/gluster/data swarm-node-3:/opt/gluster/data commit

10. 均衡volume:

[root@swarm-manager ~]#gluster volume models lay-outstart
[root@swarm-manager ~]#gluster volume models start
[root@swarm-manager ~]#gluster volume models startforce
[root@swarm-manager ~]#gluster volume models status
[root@swarm-manager ~]#gluster volume models stop

This entry was posted in Glusterfs, Servuce and tagged Glusterfs, Storage. Bookmark the permalink.

Leave a Reply Cancel reply

Search for:
Recent Posts
Recent Comments
- CBD for Sale on 浅谈美国老爷子所说的移动申请的DNS加速专利
- contactgrqgsc on 从openstack近几年发展到国内外云计算环境的不同
- Davidtaw on ping.pe
- Александр on ping.pe
- Jamesled on 浅谈美国老爷子所说的移动申请的DNS加速专利
Search

Search for:
Calendar
April 2026

S M T W T F S

1 2 3 4

5 6 7 8 9 10 11

12 13 14 15 16 17 18

19 20 21 22 23 24 25

26 27 28 29 30

« Mar
Tags
- AD
- Ansible
- CentOS
- chrony
- Daily
- datetimectl
- DNF
- DNS
- Epel
- ESXi
- Game
- Glusterfs
- H3C
- H3C R4900 G2
- Hyper-V
- iptables
- k8s
- k8s-plugin
- krew
- kubectl
- kubernetes
- LB
- LB Device
- Life
- Linux
- LoadBlancer Device
- Netscaler
- ntp
- ping.pe
- pssh
- Raspyberry Pi
- RDP
- salt
- Saltstack
- Security
- server
- ssh
- Storage
- Troubleshooting
- vSphere
- Windows
- 使徒
- 剧情
- 国内App
- 魔界
Recently
Comments
- CBD for Sale on 浅谈美国老爷子所说的移动申请的DNS加速专利
- contactgrqgsc on 从openstack近几年发展到国内外云计算环境的不同
- Davidtaw on ping.pe
- Александр on ping.pe
- Jamesled on 浅谈美国老爷子所说的移动申请的DNS加速专利
Access map
Attack map