matrix-docker-ansible-deploy

Gráfico de commits

Autor	SHA1	Mensagem	Data
Slavi Pantaleev	f99dcd611f	Pass proper UID/GID to Synapse Fixes a regression caused by `a5ee39266c`. If the user id and group id were different than 991:991 (which used to be a hardcoded default for us long ago), there was a mismatch between what Synapse was trying to use (991:991) and what it was actually started with (in `--user=..`). It was then trying to change ownership, which was failing. This was mostly affecting newer installations which were not using the 991:991 defaults we had long ago (since `a1c5a197a9`).	5 anos atrás
Slavi Pantaleev	a5ee39266c	Go through start.py when launching Synapse This allows us to benefit from helpful things it does for us, like enabling jemalloc: https://github.com/matrix-org/synapse/pull/8553 We weren't going through `start.py` before, because it was causing some conflict with our `docker run --user=...` stuff, but it doesn't seem to be a problem anymore. Having done this, we won't need to do things like https://github.com/spantaleev/matrix-docker-ansible-deploy/pull/941 anymore.	5 anos atrás
Slavi Pantaleev	512f42aa76	Do not report docker kill/rm attempts as errors These are just defensive cleanup tasks that we run. In the good case, there's nothing to kill or remove, so they trigger an error like this: > Error response from daemon: Cannot kill container: something: No such container: something and: > Error: No such container: something People often ask us if this is a problem, so instead of always having to answer with "no, this is to be expected", we'd rather eliminate it now and make logs cleaner. In the event that: - a container is really stuck and needs cleanup using kill/rm - and cleanup fails, and we fail to report it because of error suppression (`2>/dev/null`) .. we'd still get an error when launching ("container name already in use .."), so it shouldn't be too hard to investigate.	5 anos atrás
Slavi Pantaleev	70796703d3	Run Synapse workers in their own containers This switches the `docker exec` method of spawning Synapse workers inside the `matrix-synapse` container with dedicated containers for each worker. We also have dedicated systemd services for each worker, so this are now: - more consistent with everything else (we don't use systemd instantiated services anywhere) - we don't need the "parse systemd instance name into worker name + port" part - we don't need to keep track of PIDs manually - we don't need jq (less depenendencies) - workers dying would be restarted by systemd correctly, like any other service - `docker ps` shows each worker separately and we can observe resource usage	5 anos atrás
Slavi Pantaleev	63301b0ef1	Improvements around Synapse worker/metrics ports exposure There was a `matrix_nginx_proxy_enabled\|default(False)` check, but: - it didn't seem to work reliably for some reason (hmm) - referring to a `matrix_nginx_proxy_*` variable from within the `matrix-synapse` role is not ideal - exposing always happened on `127.0.0.1`, which may not be good enough for some rarer setups (where the own webserver is external to the host)	5 anos atrás
Marcel Partap	edc21f15e5	Restrict publishing worker (metrics) ports to localhost	5 anos atrás
Slavi Pantaleev	1692a28fe4	Work around annoying Docker warning about undefined $HOME > WARNING: Error loading config file: .dockercfg: $HOME is not defined .. which appeared in Docker 20.10.	5 anos atrás
Slavi Pantaleev	d08b27784f	Fix systemd services autostart problem with Docker 20.10 The Docker 19.04 -> 20.10 upgrade contains the following change in `/usr/lib/systemd/system/docker.service`: ``` -BindsTo=containerd.service -After=network-online.target firewalld.service containerd.service +After=network-online.target firewalld.service containerd.service multi-user.target -Requires=docker.socket +Requires=docker.socket containerd.service Wants=network-online.target ``` The `multi-user.target` requirement in `After` seems to be in conflict with our `WantedBy=multi-user.target` and `After=docker.service` / `Requires=docker.service` definitions, causing the following error on startup for all of our systemd services: > Job matrix-synapse.service/start deleted to break ordering cycle starting with multi-user.target/start A workaround which appears to work is to add `DefaultDependencies=no` to all of our services.	5 anos atrás
Marcel Partap	f201bca519	synapse workers: define and expose METRICS port for each worker As seen on TV: https://github.com/matrix-org/synapse/blob/master/docs/metrics-howto.md#monitoring-workers	5 anos atrás
Slavi Pantaleev	1fca917ad1	Replace some -v instances with --mount `-v` magically creates the source destination as a directory, if it doesn't exist already. We'd like to avoid this magic and the potential breakage that it might cause. We'd rather fail while Docker tries to find things to `--mount` than have it automatically create directories and fail anyway, while having contaminated the filesystem. There's a lot more `-v` instances remaining to be fixed later on. This is just some start. Things like `matrix_synapse_container_additional_volumes` and `matrix_nginx_proxy_container_additional_volumes` were not changed to use `--mount`, as options for each one are passed differently (`ro` is `ro`, but `rw` doesn't exist and `slave` is `bind-propagation=slave`). To avoid breaking people's custom volume mounts, we keep it as it is for now. A deficiency with `--mount` is that it lacks the `z` option (SELinux ownership changes), and some of our `-v` instances use that. I'm not sure how supported SELinux is for us right now, but it might be, and breaking that would not be a good idea.	5 anos atrás
Marcel Partap	2d1b9f2dbf	synapse workers: reworkings + get endpoints from upstream docs via awk (yes, a bit awkward and brittle… xD)	5 anos atrás
Slavi Pantaleev	8bae39050e	Update settings for Synapse v1.14.0	5 anos atrás
Chris van Dijk	6334f6c1ea	Remove hardcoded command paths in systemd unit files Depending on the distro, common commands like sleep and chown may either be located in /bin or /usr/bin. Systemd added path lookup to ExecStart in v239, allowing only the command name to be put in unit files and not the full path as historically required. At least Ubuntu 18.04 LTS is however still on v237 so we should maintain portability for a while longer.	5 anos atrás
Slavi Pantaleev	8fb3ce6f6d	Upgrade Synapse (v1.12.4 -> v1.13.0)	5 anos atrás
Marcel Partap	66a4073512	Publish synapse worker ports, need to be accessible to nginx	5 anos atrás
Aaron Raimist	79d1576648	Allow Synapse manhole to be enabled Can you double check that the way I have this set only exposes it locally? It is important that the manhole is not available to the outside world since it is quite powerful and the password is hard coded.	6 anos atrás
recklesscoder	5d3b765241	Actually use matrix_synapse_storage_path matrix_synapse_storage_path is already defined in matrix-synapse/defaults/main.yml (with a default of "{{ matrix_synapse_base_path }}/storage"), but was not being used for its presumed purpose in matrix-synapse.service.j2. As a result, if matrix_synapse_storage_path was overridden (in a vars.yml), the synapse service failed to start.	6 anos atrás
Slavi Pantaleev	d6d6c152a3	Delay bridge startup to ensure Synapse is up Bridges start matrix-synapse.service as a dependency, but Synapse is sometimes slow to start, while bridges are quick to hit it and die (if unavailable). They'll auto-restart later, but .. this still breaks `--tags=start`, which doesn't wait long enough for such a restart to happen. This attempts to slow down bridge startup enough to ensure Synapse is up and no failures happen at all.	6 anos atrás
Slavi Pantaleev	ab59cc50bd	Add support for more flexible container port exposing Fixes #171 (Github Issue).	6 anos atrás
Slavi Pantaleev	ae7c8d1524	Use SyslogIdentifier to improve logging Reasoning is the same as for matrix-org/synapse#5023. For us, the journal used to contain `docker` for all services, which is not very helpful when looking at them all together (`journalctl -f`).	6 anos atrás
Hugues De Keyzer	c451025134	Fix indentation in templates Use Jinja2 lstrip_blocks option in templates to ensure consistent indentation in generated files.	6 anos atrás
Sylvia van Os	75b1528d13	Add the possibility to pass extra flags to the docker container	6 anos atrás
Slavi Pantaleev	892abdc700	Do not refer to Synapse as "Matrix Synapse"	6 anos atrás
Slavi Pantaleev	91a757c581	Add support for reloading Synapse	7 anos atrás
Slavi Pantaleev	f6ebd4ce62	Initial work on Synapse 0.99/1.0 preparation	7 anos atrás
dhose	87e3deebfd	Enable exposure of Prometheus metrics.	7 anos atrás
Slavi Pantaleev	0be7b25c64	Make (most) containers run with a read-only filesystem	7 anos atrás
Slavi Pantaleev	316d653d3e	Drop capabilities in containers We run containers as a non-root user (no effective capabilities). Still, if a setuid binary is available in a container image, it could potentially be used to give the user the default capabilities that the container was started with. For Docker, the default set currently is: - "CAP_CHOWN" - "CAP_DAC_OVERRIDE" - "CAP_FSETID" - "CAP_FOWNER" - "CAP_MKNOD" - "CAP_NET_RAW" - "CAP_SETGID" - "CAP_SETUID" - "CAP_SETFCAP" - "CAP_SETPCAP" - "CAP_NET_BIND_SERVICE" - "CAP_SYS_CHROOT" - "CAP_KILL" - "CAP_AUDIT_WRITE" We'd rather prevent such a potential escalation by dropping ALL capabilities. The problem is nicely explained here: https://github.com/projectatomic/atomic-site/issues/203	7 anos atrás
Slavi Pantaleev	299a8c4c7c	Make (most) containers start as non-root This makes all containers (except mautrix-telegram and mautrix-whatsapp), start as a non-root user. We do this, because we don't trust some of the images. In any case, we'd rather not trust ALL images and avoid giving `root` access at all. We can't be sure they would drop privileges or what they might do before they do it. Because Postfix doesn't support running as non-root, it had to be replaced by an Exim mail server. The matrix-nginx-proxy nginx container image is patched up (by replacing its main configuration) so that it can work as non-root. It seems like there's no other good image that we can use and that is up-to-date (https://hub.docker.com/r/nginxinc/nginx-unprivileged is outdated). Likewise for riot-web (https://hub.docker.com/r/bubuntux/riot-web/), we patch it up ourselves when starting (replacing the main nginx configuration). Ideally, it would be fixed upstream so we can simplify.	7 anos atrás
Slavi Pantaleev	56d501679d	Be explicit about the UID/GID we start Synapse with We do match the defaults anyway (by default that is), but people can customize `matrix_user_uid` and `matrix_user_uid` and it wouldn't be correct then. In any case, it's better to be explicit about such an important thing.	7 anos atrás
Slavi Pantaleev	c10182e5a6	Make roles more independent of one another With this change, the following roles are now only dependent on the minimal `matrix-base` role: - `matrix-corporal` - `matrix-coturn` - `matrix-mailer` - `matrix-mxisd` - `matrix-postgres` - `matrix-riot-web` - `matrix-synapse` The `matrix-nginx-proxy` role still does too much and remains dependent on the others. Wiring up the various (now-independent) roles happens via a glue variables file (`group_vars/matrix-servers`). It's triggered for all hosts in the `matrix-servers` group. According to Ansible's rules of priority, we have the following chain of inclusion/overriding now: - role defaults (mostly empty or good for independent usage) - playbook glue variables (`group_vars/matrix-servers`) - inventory host variables (`inventory/host_vars/matrix.<your-domain>`) All roles default to enabling their main component (e.g. `matrix_mxisd_enabled: true`, `matrix_riot_web_enabled: true`). Reasoning: if a role is included in a playbook (especially separately, in another playbook), it should "work" by default. Our playbook disables some of those if they are not generally useful (e.g. `matrix_corporal_enabled: false`).	7 anos atrás
Slavi Pantaleev	51312b8250	Split playbook into multiple roles As suggested in #63 (Github issue), splitting the playbook's logic into multiple roles will be beneficial for maintainability. This patch realizes this split. Still, some components affect others, so the roles are not really independent of one another. For example: - disabling mxisd (`matrix_mxisd_enabled: false`), causes Synapse and riot-web to reconfigure themselves with other (public) Identity servers. - enabling matrix-corporal (`matrix_corporal_enabled: true`) affects how reverse-proxying (by `matrix-nginx-proxy`) is done, in order to put matrix-corporal's gateway server in front of Synapse We may be able to move away from such dependencies in the future, at the expense of a more complicated manual configuration, but it's probably not worth sacrificing the convenience we have now. As part of this work, the way we do "start components" has been redone now to use a loop, as suggested in #65 (Github issue). This should make restarting faster and more reliable.	7 anos atrás
Slavi Pantaleev	bfcceb1e82	Make it safer to override matrix_synapse_media_store_path This is described in Github issue #58. Until now, we had the variable, but if you redefined it, you'd run into multiple problems: - we actually always mounted some "storage" directory to the Synapse container. So if your media store is not there, you're out of luck - homeserver.yaml always hardcoded the path to the media store, as a directory called "media-store" inside the storage directory. Relocating to outside the storage directory was out of the question. Moreover, even if you had simply renamed the media store directory (e.g. "media-store" -> "media_store"), it would have also caused trouble. With this patch, we mount the media store's parent to the Synapse container. This way, we don't care where the media store is (inside storage or not). We also don't assume (anymore) that the final part of the path is called "media-store" -- anything can be used. The "storage" directory and variable (`matrix_synapse_storage_path`) still remain for compatibility purposes. People who were previously overriding `matrix_synapse_storage_path` can continue doing so and their media store will be at the same place. The playbook no longer explicitly creates the `matrix_synapse_storage_path` directory though. It's not necessary. If the media store is specified to be within it, it will get created when the media store directory is created by the playbook.	7 anos atrás
Slavi Pantaleev	fb5115a544	Rename playbook variables so they are consistently prefixed Pretty much all variables live in their own `matrix_<whatever>` prefix now and are grouped closer together in the default variables file (`roles/matrix-server/defaults/main.yml`).	7 anos atrás
Slavi Pantaleev	67a445a74a	Add support for controlling Matrix federation	7 anos atrás
Slavi Pantaleev	242f388af3	Make Synapse cache factor configurable	7 anos atrás
Slavi Pantaleev	161854e6d7	Disable Docker container logging `--log-driver=none` is used for all Docker containers now. All these containers are started through systemd anyway and get logged in journald, so there's no need for Docker to be logging the same thing using the default `json-file` driver. Doing that was growing `/var/lib/docker/containers/..` infinitely until service/container restart. As a result of this, things like `docker logs matrix-synapse` won't work anymore. `journalctl -u matrix-synapse` is how one can see the logs.	7 anos atrás
Slavi Pantaleev	ea43d46b70	Add matrix-synapse-rest-auth support	7 anos atrás
Slavi Pantaleev	21da2f572b	Add email-sending support	7 anos atrás
Slavi Pantaleev	084a0a0e53	Minor consistency improvement	7 anos atrás
Slavi Pantaleev	700602eed3	Rename a bunch of playbook variables for better consistency	7 anos atrás
Slavi Pantaleev	3fd6fd647f	Put all containers in their own isolated Docker network (matrix) Moving away from using the default bridge network to using our own. This isolates our services from other Docker containers running on the default network on the same host. The benefits are that: - isolation is a little better - we no longer share a default bridge network with any other containers that might be running on the host - there are no longer hard dependencies - we do service discovery by DNS name, and not via explicit `--link` usage during container start, so containers can start out of order and fail without bringing down others with them (`matrix-nginx-proxy` can continue running, even if one of the other services dies) In the future, when other services get introduced, the increased resilience and simplicity will help as well.	7 anos atrás
Slavi Pantaleev	5399e2b6bb	Do not require (but want) matrix-coturn.service in matrix-synapse It's not really a requirement, as it can function without it. Also, restarting matrix-coturn doesn't need to restart matrix-synapse.	7 anos atrás
Slavi Pantaleev	b3e62126db	Switch Docker image to official one Switching from from avhost/docker-matrix (silviof/docker-matrix) to matrixdotorg/synapse. The avhost/docker-matrix (silviof/docker-matrix) image used to bundle in the coturn STUN/TURN server, so as part of the move, we're separating this to a separately-ran service (matrix-coturn.service, powered by instrumentisto/coturn-docker-image)	7 anos atrás
Slavi Pantaleev	efc78fb9d3	Switch from s3fs to Goofys Improves performance of media store operations.	8 anos atrás
Slavi Pantaleev	3a5f82267b	Do not use Let's Encrypt certificate for Synapse's federation port As described here ( https://github.com/matrix-org/synapse/issues/2438#issuecomment-327424711 ), using own SSL certificates for the federation port is more fragile, as renewing them could cause federation outages. The recommended setup is to use the self-signed certificates generated by Synapse. On the 443 port (matrix-nginx-proxy) side, we still use the Let's Encrypt certificates, which ensures API consumers work without having to trust "our own CA". Having done this, we also don't need to ever restart Synapse anymore, as no new SSL certificates need to be applied there. It's just matrix-nginx-proxy that needs to be restarted, and it doesn't even need a full restart as an "nginx reload" does the job of swithing to the new SSL certificates.	8 anos atrás
Slavi Pantaleev	6962bfcc42	Add support for not taking over a server (no matrix-nginx-proxy) and disabling Riot	8 anos atrás
Slavi Pantaleev	cb323f5b4c	Move SSL certificates from /etc/pki/acmetool-certs to /matrix/ssl Moving keeps everything in the /matrix directory, so that we wouldn't contaminate anything else on the system or risk clashing with something else. Also retrieving certificates separately for the Riot and Matrix domains, which should help in multiple ways: - allows them to be very different (completely separate base domain..) - allows for Riot to be disabled for the playbook some time later and still have the code not break	8 anos atrás
Slavi Pantaleev	ded7c274f6	Add support for Debian (9+) and Ubuntu (16.04+)	8 anos atrás
Slavi Pantaleev	ab1a9fd87e	Add support for using an external PostgreSQL server	8 anos atrás

38 Commits (ea6e344d055eebf3e837da9caad5fae5ff2e1fe9)