If a few servers are linked up and talk to each other using TCP/IP (?) but aren’t connected to the wider network, that’s not enough for it to be considered another internet (but it could be an intranet).
If a few instances are linked up and talk to each other using ActivityPub but aren’t connected to the wider network, I think that’s not enough for it to be considered another fediverse.
I can get behind the general idea, but in this implementation specifically it seems like the low modulation example isn’t distinct enough from simply lower-quality audio, but the higher modulation example (where the effect is more distinct as an intentional effect), is just not nice to listen to. Maybe there are other ways to distort the voice that don’t have as much of that downside?