firezone

mirror of https://github.com/outbackdingo/firezone.git synced 2026-01-27 18:18:55 +00:00

Author	SHA1	Message	Date
Reactor Scram	c0ebb98ec9	chore(gui-client): measure startup time (#5864 ) Refs #5026	2024-07-16 14:49:12 +00:00
Reactor Scram	03089c0bde	docs(gui-client): update dev docs (#5858 ) The docs for DNS control and process splitting no longer have any information more useful, accurate, or fresh than the code itself	2024-07-16 14:49:00 +00:00
Gabi	5b0aaa6f81	fix(connlib): protect all sockets from routing loops (#5797 ) Currently, only connlib's UDP sockets for sending and receiving STUN & WireGuard traffic are protected from routing loops. This is was done via the `Sockets::with_protect` function. Connlib has additional sockets though: - A TCP socket to the portal. - UDP & TCP sockets for DNS resolution via hickory. Both of these can incur routing loops on certain platforms which becomes evident as we try to implement #2667. To fix this, we generalise the idea of "protecting" a socket via a `SocketFactory` abstraction. By allowing the different platforms to provide a specialised `SocketFactory`, anything Linux-based can give special treatment to the socket before handing it to connlib. As an additional benefit, this allows us to remove the `Sockets` abstraction from connlib's API again because we can now initialise it internally via the provided `SocketFactory` for UDP sockets. --------- Signed-off-by: Gabi <gabrielalejandro7@gmail.com> Co-authored-by: Thomas Eizinger <thomas@eizinger.io>	2024-07-16 00:40:05 +00:00
Thomas Eizinger	14abda01fd	refactor(connlib): polish DNS resource matching (#5866 ) In preparation for implementing #5056, I familiarized myself with the current code and ended up implementing a couple of refactorings.	2024-07-15 23:56:48 +00:00
Gabi	7436f86332	chore(connlib): remove warnings for non-proptest tests (#5883 ) Extracted from #5797	2024-07-15 21:52:22 +00:00
Thomas Eizinger	847e7801f6	test(connlib): remove `Tick` transition (#5867 ) When the property-based state machine test was first created, I envisioned that we could also easily test advancing time. Unfortunately, the tricky part of advancing time is to correctly encode the _expected_ behaviour as it requires knowledge of all timeouts etc. Thus, the `Tick` transition has been left lingering and doesn't actually test much. It is obviously still sampled by the test runner and thus "wastes" test cases that don't end up exercising anything meaningful because the time advancements are < 1000ms. There are plans to more roughly test time-related things by implementing delays between applying `Transmit`s. Until then, we can remove the `Tick` transition.	2024-07-15 21:08:22 +00:00
Reactor Scram	b539c01cab	fix(gui-client/linux): show a specific error message in the GUI when updating (#5848 ) Closes #5790 (we could do more, but this might be sufficient) <img width="892" alt="image" src="https://github.com/firezone/firezone/assets/13400041/5ce84a05-2f6c-4ffd-ba21-19d3087c8c7f"> <img width="641" alt="image" src="https://github.com/firezone/firezone/assets/13400041/2da7ff36-a969-444e-885c-ae0919577f56"> The code is cross-platform, but this is unlikely to happen on Windows because the MSI refuses to update if the GUI process is running. On Linux `apt-get` will update and restart the IPC service without touching the GUI process. ```[tasklist] - [x] Test on Linux with `apt-get install` - [x] Update changelog - [x] Run a 5-minute smoke test on Linux - [x] Run a 5-minute smoke test on Windows - [x] Open for review - [ ] Merge ``` --------- Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-15 17:53:24 +00:00
Thomas Eizinger	a4a8221b8b	refactor(connlib): explicitly initialise `Tun` (#5839 ) Connlib's routing logic and networking code is entirely platform agnostic. The only platform-specific bit is how we interact with the TUN device. From connlib's perspective though, all it needs is an interface for reading and writing. How the device gets initialised and updated is client-business. For the most part, this is the same on all platforms: We call callbacks and the client updates the state accordingly. The only annoying bit here is that Android recreates the TUN interface on every update and thus our old file descriptor is invalid. The current design works around this by returning the new file descriptor on Android. This is a problematic design for several reasons: - It forces the callback handler to finish synchronously, and halting connlib until this is complete. - The synchronous nature also means we cannot replace the callbacks with events as events don't have a return value. To fix this, we introduce a new `set_tun` method on `Tunnel`. This moves the business of how the `Tun` device is created up to the client. The clients are already platform-specific so this makes sense. In a future iteration, we can move all the various `Tun` implementations all the way up to the client-specific crates, thus co-locating the platform-specific code. Initialising `Tun` from the outside surfaces another issue: The routes are still set via the `Tun` handle on Windows. To fix this, we introduce a `make_tun` function on `TunDeviceManager` in order for it to remember the interface index on Windows and being able to move the setting of routes to `TunDeviceManager`. This simplifies several of connlib's APIs which are now infallible. Resolves: #4473. --------- Co-authored-by: Reactor Scram <ReactorScram@users.noreply.github.com> Co-authored-by: conectado <gabrielalejandro7@gmail.com>	2024-07-12 23:54:15 +00:00
Reactor Scram	a8ece49d9e	chore: bump GUI to 1.1.6 (#5862 ) I started a playbook for publishing GUI releases, I didn't see any other one around. I think there's a middle step I'm not clear on: 1. Open this PR and get it approved 2. Do something? Publish the draft release maybe? Run a special CI workflow? 3. Merge this PR to update the changelog and bump the versions in Git ```[tasklist] ### Tasks ```	2024-07-12 18:45:56 +00:00
Thomas Eizinger	a4714d6de3	chore(connlib): print error after panicking (#5854 )	2024-07-12 14:30:11 +00:00
Thomas Eizinger	c92dd559f7	chore(rust): format `Cargo.toml` using `cargo-sort` (#5851 )	2024-07-12 04:57:22 +00:00
Thomas Eizinger	71f8b86b78	test(connlib): don't update resources as part of adding new ones (#5834 ) Currently, `tunnel_test` has some old code that attempted to handle resource _updates_ as part of adding new ones. That is outdated and wrong. The test is easier to reason about if we disallow updates to resources as part of _adding_ a new one. In production, resources IDs are unique so this shouldn't actually happen. At a later point, we can add explicit transitions for updating an existing resource.	2024-07-12 00:30:18 +00:00
Thomas Eizinger	d95193be7d	test(connlib): introduce dynamic number of gateways to `tunnel_test` (#5823 ) Currently, `tunnel_test` exercises a lot of code paths within connlib already by adding & removing resources, roaming the client and sending ICMP packets. Yet, it does all of this with just a single gateway whereas in production, we are very likely using more than one gateway. To capture these other code-paths, we now sample between 1 and 3 gateways and randomly assign the added resources to one of them, which makes us hit the codepaths that select between different gateways. Most importantly, the reference implementation has barely any knowledge about those individual connections. Instead, it is implementation in terms of connectivity to resources.	2024-07-11 23:42:46 +00:00
Thomas Eizinger	960ce80680	refactor(connlib): move `TunDeviceManager` into `firezone-bin-shared` (#5843 ) The `TunDeviceManager` is a component that the leaf-nodes of our dependency tree need: the binaries. Thus, it is misplaced in the `connlib-shared` crate which is at the very bottom of the dependency tree. This is necessary to allow the `TunDeviceManager` to actually construct a `Tun` (which currently lives in `firezone-tunnel`). Related: #5839. --------- Signed-off-by: Thomas Eizinger <thomas@eizinger.io> Co-authored-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-11 23:42:33 +00:00
Thomas Eizinger	2013d6a2bf	chore(connlib): improve logging (#5836 ) Currently, the logging of fields in spans for encapsulate and decapsulate operations is a bit inconsistent between client and gateway. Logging the `from` field for every message is actually quite redundant because most of these logs are emitted within `snownet`'s `Allocation` which can add its own span to indicate, which relay we are talking to. For most other operations, it is much more useful to log the connection ID instead of IPs. This should make the logs a bit more succinct.	2024-07-11 23:38:19 +00:00
Reactor Scram	64e0b71b77	feat(gui-client): set a different tray icon when signed out (#5817 ) Closes #5810 ```[tasklist] ### Tasks - [x] Try not to set the icon every time we change Resources - [x] Get production icons - [x] Add changelog comment - [x] Add CI stress test that sets the icon 10,000 times - [x] Open for review - [x] Repair changelog - [ ] Merge ``` --------- Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-11 20:50:44 +00:00
Thomas Eizinger	08182913a5	refactor(connlib): remove `CidrV4` and `CidrV6` types from callbacks (#5842 ) These are only necessary for the Android and Apple client. Other clients should not need to bother with these custom types. Required-for: #5843.	2024-07-11 14:25:26 +00:00
Thomas Eizinger	f39a57fa50	refactor(connlib): remove cyclic `From` impls (#5837 ) We have several representations of `ResourceDescription` within connlib. The ones within the `callbacks` module are meant for _presentation_ to the clients and thus contain additional information like the site status. The `From` impls deleted within the PR are only used within tests. We can rewrite those tests by asserting on the presented data instead. This is better because it means information about resources only flows in one direction: From connlib to the clients.	2024-07-11 14:21:33 +00:00
Thomas Eizinger	03c0da8995	chore(connlib): ensure span is activate during test init (#5835 ) Applying the initial `init` closure may also print logs that are currently not captured within the corresponding span. By using `in_scope`, we ensure those logs are also correctly captured in the corresponding span.	2024-07-11 14:20:15 +00:00
Reactor Scram	cb2bddae7e	refactor(ipc-service/windows): remove unnecessary `tokio::spawn` (#5813 ) This also improves some function names (i.e. don't say `windows_` when we're already in `windows.rs`) and adds comments justifying why some functions with only one call site are split out I started this intending to use it to practice the sans-I/O style. It didn't come up but I did get rid of that `spawn`	2024-07-11 14:17:55 +00:00
Thomas Eizinger	8ec6a809a1	refactor(relay): use `RangeInclusive` to specify available ports (#5820 )	2024-07-11 06:26:21 +00:00
Thomas Eizinger	00a3940717	chore(rust): introduce `tokio` workspace dependency (#5821 ) We are referencing the `tokio` dependency a lot and it makes sense to ensure that version is tracked only once across the whole workspace. Extracted out of #5797. --------- Co-authored-by: Not Applicable <ReactorScram@users.noreply.github.com>	2024-07-10 23:40:34 +00:00
Thomas Eizinger	0c2648dae2	test(connlib): correctly scope state within `tunnel_test` (#5809 ) Currently, the type hierarchy within `tunnel_test` is already quite nested: We have a `Host` that wraps a `SimNode` which wraps a `ClientState` or `GatewayState`. Additionally, a lot of state that is actually _per_ client or _per_ gateway is tracked in the root of `ReferenceState` and `TunnelTest`. That makes it difficult to introduce multiple gateways / clients to this test. To fix this, we introduce dedicated `RefClient` and `RefGateway` states. Those track the expected state of a particular client / gateway. Similarly, we introduce dedicated `SimClient` and `SimGateway` structs that track the simulation state by wrapping the corresponding system-under-test: `ClientState` a `GatewayState`. This ends up moving a lot of code around but has the great benefit that all the state is now scoped to a particular instance of a client or a gateway, paving the way for creating multiple clients & gateways in a single test.	2024-07-10 23:22:19 +00:00
Reactor Scram	7e04d62daa	fix(gui-client): catch IPC connection dropouts as fatal errors (#5795 ) Closes #5760, refs #5790 Also removes some redundant IPC-related code that was nearby. If you stop the IPC service, e.g. due to an update on Linux, it will say "IPC connection closed". This isn't ideal but at least the Client does catch it now, instead of failing on the next IPC send. --------- Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-10 22:38:55 +00:00
Reactor Scram	c8c349ac41	refactor(gui-client): simplify IPC and how Resources in the menu are updated (#5824 ) The Arc+Notify thing was always overkill, I just thought it was useful early on. With the IPC change it's easier to just use the existing MPSC channel Also removing `TunnelReady` and assuming that the tunnel is ready whenever connlib sends us the first Resource list --------- Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-10 21:37:59 +00:00
Reactor Scram	c3380daa75	fix(gui-client/windows): deactivate DNS control when we stop connlib (#5828 ) Closes #5827 --------- Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com> Co-authored-by: Jamil <jamilbk@users.noreply.github.com>	2024-07-10 20:29:16 +00:00
Reactor Scram	78f1c7c519	test(firezone-tunnel/windows): Test Windows upload speed in CI (#5607 ) Closes #5601 It looks like we can hit 100+ Mbps in theory. This covers Wintun, Tokio, and Windows OS overhead. It doesn't cover the cryptography or anything in connlib itself. The code is kinda messy but I'm not sure how to clean it up so I'll just leave it for review. This test should fail if there's any regressions in #5598. It fails if any packet is dropped or if the speed is under 100 Mbps ```[tasklist] ### Tasks - [x] Use `ip_packet::make` - [x] Switch to `cargo bench` - [x] Extract windows ARM PR - [x] Clean up wintun.dll install code - [x] Re-request review ```	2024-07-10 19:09:45 +00:00
Reactor Scram	565602fadb	refactor(headless-client): clean up signal handling code (#5799 ) Left over from #5789 This removes SIGHUP for the IPC service, which doesn't handle it anyway, so it removes a code path that would just panic. ```[tasklist] ### Tasks - [ ] Can we test this at all? ```	2024-07-10 18:38:24 +00:00
Thomas Eizinger	0e6ac2040c	test(connlib): use two relays in `tunnel_test` (#5804 ) With the introduction of a routing table in #5786, we can very easily introduce an additional relay to `tunnel_test`. In production, we are always given two relays and thus, this mimics the production setup more closely.	2024-07-09 23:47:35 +00:00
Thomas Eizinger	f3fa0c7e5f	test(connlib): reduce cycles of `resource_management` test (#5807 ) With the performance improvements of `tunnel_test` in #5786, the `resource_management` test is now in the hot-path of CI runtime. We reduce the cycles to 50 should cut down overall CI time by ~ 1 minute as the Windows builds are among the slowest. Signed-off-by: Reactor Scram <ReactorScram@users.noreply.github.com> Co-authored-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-09 14:50:12 +00:00
Thomas Eizinger	d15c43b6f2	test(connlib): render IDs as hex u128 (#5803 ) This is a bit of a hack because features should never change behaviour. Unfortunately, we can't use `cfg(test)` here because the proptests live in a different crate and thus for the tests, we import the crate using `cfg(not(test))`. Our `proptest` feature is really only meant to be activated during testing so I think this is fine for now. The benefit is that the test logs are much more terse because proptest will shrink the IDs to `0`, `1` etc. With the upcoming addition of multiple gateways and multiple relays, we will have a lot more IDs in the logs. Thus, it is important that they stay legible.	2024-07-09 14:23:37 +00:00
Thomas Eizinger	a3c9617faa	test(connlib): ensure Windows test module follows conventions (#5806 ) By convention, `tests` modules are usually feature-flagged to not end up in production code. Additionally, a `use super::*;` import line ensures we have access to the parent module which is usually the one you want to test.	2024-07-09 14:12:44 +00:00
Thomas Eizinger	f8468813c3	test(tunnel): use hex notation for IPv6 network (#5808 )	2024-07-09 14:11:46 +00:00
Jamil	ef3b4e5dfe	feat(linux-gui): Bump GUI to 1.1.5 for arm64 support (#5800 )	2024-07-08 21:58:10 -07:00
Thomas Eizinger	9caca475dc	test(connlib): introduce routing table to `tunnel_test` (#5786 ) Currently, `tunnel_test` uses a rather naive approach when dispatching `Transmit`s. In particular, it checks client, gateway and relay separately whether they "want" a certain packet. In a real network, these packets are routed based on their IP. To mimic something similar, we introduce a `Host` abstraction that wraps each component: client, gateway and relay. Additionally, we introduce a `RoutingTable` where we can add and remove hosts. With these things in place, routing a `Transmit` is as easy as looking up the destination IP in the routing table and dispatching to the corresponding host. Our hosts are type-safe: client, gateway and relay have different types. Thus, we abstract over them using a `HostId` in order to know, which host a certain message is for. Following these patches, we can easily introduce multiple gateways and relays to this test by simply making more entries in this routing table. This will increase the test coverage of connlib. Lastly, this patch massively increases the performance of `tunnel_test`. It turns out that previously, we spent a lot of CPU cycles accessing "random" IPs from very large iterators. With this patch, we take a limited range of 100 IPs that we sample from, thus drastically increasing performance of this test. The configured 1000 testcases execute in 3s on my machine now (with opt-level 1 which is what we use in CI). --------- Signed-off-by: Thomas Eizinger <thomas@eizinger.io>	2024-07-09 01:48:54 +00:00
Reactor Scram	927702cd2f	chore(gui-client): fix papercuts (#5792 ) Closes #5789 The SIGTERM catching would have helped debug #5790 ```[tasklist] ### Tasks - [x] catch SIGTERM and log when systemd shuts us down gracefully - [x] Log architecture at startup ```	2024-07-08 22:20:57 +00:00
Jamil	0ff45c34f9	fix(style): Set prettier prosewrap to `preserve` to let authors write MD in their own style (#5722 ) Prettier has three options for prose-wrap: - `always`: Format prose (markdown) to the line-length (current) - `never`: Use a single line for all prose (proposed) - `preserve`: Don't lint prose Settled on `preserve` due to discussion. Fixes #5686 --------- Signed-off-by: Jamil <jamilbk@users.noreply.github.com> Co-authored-by: Reactor Scram <ReactorScram@users.noreply.github.com>	2024-07-08 14:26:58 +00:00
Reactor Scram	7469f44fc4	refactor(headless-client): remove unnecessary derived impl of `PartialEq` (#5758 ) I didn't know about `matches!` back then	2024-07-08 13:57:18 +00:00
Jamil	aa7977c9b5	chore: bump android 1.1.3 (#5784 )	2024-07-06 16:54:14 -07:00
Jamil	7820e3f3c7	fix(android): Strip scope id off IPv6 addresses Android (#5783 ) Fixes #5781 --------- Signed-off-by: Jamil <jamilbk@users.noreply.github.com>	2024-07-06 16:50:30 -07:00
Jamil	e39ce22b36	chore: Publish new linux/windows clients (#5767 ) Adds the DNS fix.	2024-07-05 13:19:30 -07:00
Reactor Scram	35926eb12f	refactor(gui-client): connect to the IPC service immediately when the GUI starts (#5704 ) I had to change the smoke test because it had a couple issues: - The IPC socket had the wrong permissions because I didn't realize you can tell `su` / `sudo` / `runuser` to set a group in addition to setting a user - It had a hard-coded timer of 12 seconds, and one time the test failed because the IPC service exited before the GUI finished loading. So I changed it so the IPC service in smoke test mode will wait forever for exactly one client, then quit ```[tasklist] ### Tasks - [x] Run `chown` in the Ubuntu smoke test ```	2024-07-05 17:44:12 +00:00
Reactor Scram	663367b605	chore(gui-client): timestamp crash dump file names (#5452 ) Closes #5449 The smoke tests expect `last_crash.dmp` at a fixed path, so in this case we write the file with a timestamped name, then copy it over `last_crash.dmp`.	2024-07-05 15:21:25 +00:00
Thomas Eizinger	28d5b8574c	chore(connlib): minor logging tweaks (#5746 ) Noticed a few things that caused unnecessary verbosity in the logs.	2024-07-05 14:45:32 +00:00
Thomas Eizinger	2a2877a4d9	test(snownet): add debug assert (#5750 ) Within `snownet`'s test harness, packets are dispatched in a particular order and of none of them match. They are assumed to be for the node directly. We add a debug assert to ensure that the given address is in fact part of the "local" interfaces that we have configured in the tests.	2024-07-05 07:00:24 +00:00
Thomas Eizinger	a57c64e62b	chore(snownet): add some debug logs around channel bindings (#5749 )	2024-07-05 07:00:03 +00:00
Reactor Scram	7e9db1d876	chore(headless-client): fix typo in match statement (#5706 ) PR #5700 had a typo in it. I didn't notice that these match arms use `\|`, so I accidentally flush the DNS for an event that doesn't need it. Only `OnUpdateResources` should flush DNS.	2024-07-05 03:16:33 +00:00
Reactor Scram	d0f68fc133	test(gui-client): multi-process smoke test for GUI + IPC service (#5672 ) ```[tasklist] ### Tasks - [x] Check the GUI saves its settings file - [x] Check the IPC service writes the device ID to disk - [x] Check the GUI writes a log file (skipped - we already check if the exported zip has any files in it) - [x] Run the crash file through `minidump-stackwalk` - [x] Reach feature parity with the original smoke tests - [x] Ready for review - [x] Finish #5452 - [ ] Start on #5453 ```	2024-07-04 21:10:31 +00:00
Jamil	60d2a2befd	fix(infra): relay listens on UDP only (#5718 ) I don't believe we use/need TCP for the Relays. Better to keep the ports closed if so. Also, the docker-compose.yml is updated to allow the `relay-1` service to respond to all its ports, since we don't need those mapped typically.	2024-07-04 16:53:08 +00:00
Jamil	086c730aaf	chore: Bump clients to 1.1.2 for DNS record type forward (#5703 ) Apps are already in review with App Stores	2024-07-04 01:31:26 +00:00

1 2 3 4 5 ...

1230 Commits