Track Internet disconnections, provider outages with historical data, and automated speed testing.
For Windows, Linux, ARM64, ARMa7. Learn more by visiting www.outagesio.com
Notice: If you created an account on app.outagesio.com, simply use the same credentials to log in here.
Trying to install on Linux
-
Hello,
I was trying to install the agent on linux.
I created the file with the credentials in the etc directory.
I created the otm directory in my /home and copied the script in the same directory and made it executable.
When executing it just loops trying to download and failing to create some files:
–-------------------
dadak@dadak-Inspiron-15-7569:~/otm$ ./starter_linux_ocp.sh
Make sure to create /etc/agentid with credentials before starting this script
Make sure your starter file points to the correct directory where you are placed the binary into - Edit this starter file accordingly
You could add this to /etc/rc.local to start it or use some other method.
Receiving binary location
Getting the OTM Package
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0Warning: Failed to create the file /otm/otminfo.log: No such file or directory
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
curl: (23) Failed writing body (0 != 17)
cat: /otm/otminfo.log: No such file or directory
cat: /otm/otminfo.log: No such file or directory
^Z
[1]+ Stopped ./starter_linux_ocp.sh
–----------------------
Any idea what I am doing wrong?
The installation instructions talk about: Create a directory called /otm and copy the binary into this directory.
Which binary? the script? there is no button to download any binary (like in the Windows installation Video). I understand that it should be downloaded by the script.
-
Hi and thanks for trying OutagesIO.
I downloaded the script and went through the process myself. You are right, there was a reference to binary and that has been updated now.
That could have lead to a little confusion. You'll notice the difference in the output I share below.
I think all you have is a small path problem. You mentioned putting the starter script in your /home directory somewhere.
If you look at your output, you'll see this; cat: /otm/otminfo.log: No such file or directory
I think you just need to edit your starter_linux_otm.sh script and make sure everything points to what ever directory you have the file in.
If you created /home/otm for example, make sure all references to /otm are in fact /home/otm. Then I think you'll be fine.
Here is what I did.
I created a new software agent from the Agents list.
I created the /etc/agentid file and put the shown credentials in it.
vi /etc/agentid
xxxxxxxx
xxxxxxxxxxxxxxxxxxxx
I saved this.
I created a directory called /otm and cd'd into it.
I copied the script code shown in the installation notes provided when I created the agent.
I then created the starter file
vi starter_linux_otm.sh
I pasted the contents of the code found in the installation notes.
I then started the file and everything worked as it should.
./starter_linux_ocp.sh
Make sure to create /etc/agentid with credentials before starting this script
Make sure your starter file points to the same directory that you have it in - Edit this starter file accordingly
In this case, this script is set to run the agent from a directory called /otm
You could add this to /etc/rc.local to start it or build a service for it. Please search Google for more.
Receiving binary location
Getting the OTM Package
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 150k 0 0 0 0 0 0 –:--:-- --:--:-- --:--:-- 0
Downloading OTM
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 150k 100 150k 0 0 299k 0 --:--:-- --:--:-- --:--:-- 300k
OTM show be downloaded and if so, we're starting it and exiting this script
ps aux | grep otm
root 19796 0.0 0.0 766856 1888 pts/0 Sl 19:17 0:00 /otm/otm_linux
root 19858 0.0 0.0 112712 964 pts/0 S+ 19:17 0:00 grep --color=auto otm
-
BTW, if you create a new agent, you'll see that we've updated the script.
It now contains a variable at the top that you set called $start.
This is the path where you want to run your starter script from.
Once set, the script will use this and you do not have to edit any other part of the starter script.
-
Thanks for the very quick response and update!
I followed the instructions and used start="/home/dadak/otm"
It shows as active now but the there were two errors during the installation I don't know if all is fine:
dadak@dadak-Inspiron-15-7569:~$ cd otm
dadak@dadak-Inspiron-15-7569:~/otm$ ./starter_linux_ocp.sh
./starter_linux_ocp.sh: line 8: /deviceid: Permission denied
[line 8: echo "$devid" >$filesetc/deviceid]
Make sure to create /etc/agentid with credentials before starting this script
Make sure you have set the 'start' path where this file is
You could add this to /etc/rc.local to start it or build a service for it. Please search Google for more.
Receiving binary location
Getting the OTM Package
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 150k 0 0 0 0 0 0 –:--:-- --:--:-- --:--:-- 0
Downloading OTM
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 150k 100 150k 0 0 44716 0 0:00:03 0:00:03 --:--:-- 44703
OTM show be downloaded and if so, we're starting it and exiting this script
dadak@dadak-Inspiron-15-7569:~/otm$ socket(SOCK_RAW): Operation not permitted
-
And in case this is usefull:
dadak@dadak-Inspiron-15-7569:~/otm$ ps aux | grep otm
dadak 19991 0.0 0.0 401576 3584 pts/1 Sl Apr28 0:00 /home/dadak/otm/otm_linux
dadak 20841 0.0 0.0 15620 980 pts/1 S+ 00:08 0:00 grep –color=auto otm
Should it run as root? If so how should I stop the current session and restart it as root.
How to remove the first incomplete agent installation from the Manage Agents page?
-
Great, you've got it running.
./starter_linux_ocp.sh: line 8: /deviceid: Permission denied
This is fine, it's just something in our own version that checks a second file for a hardware id. I'll see if we can remove that.
OTM show be downloaded and if so, we're starting it and exiting this script
dadak@dadak-Inspiron-15-7569:~/otm$ socket(SOCK_RAW): Operation not permitted
These errors are because the user that is running otm does not have full permissions needed.
You would have to run it as root or a user that has more permissions.
Should it run as root? If so how should I stop the current session and restart it as root.
As you did here;
dadak@dadak-Inspiron-15-7569:~/otm$ ps aux | grep otm
dadak 19991 0.0 0.0 401576 3584 pts/1 Sl Apr28 0:00 /home/dadak/otm/otm_linux
The process number (PID) is shown at the start, in this case 19991. Just run 'kill PID' and that will stop it.
It would be best to turn this into a service that auto starts/stops/restarts which I'm sure folks in your operating system's forums or searching Google could help. It is beyond our scope of support.
Eventually, we'll have more people on staff helping in the forums and questions like these could be answered here.
How to remove the first incomplete agent installation from the Manage Agents page?
Log into app.outagesio.com, pick the agent you want to administer if you have more than one.
Go to that agents dashboard, click on Manage.
There, you'll see a 'Delete' option. Click on this and you will be able to remove this agent from your account.
Glad you got this working. Let me know if you need any more help.
If you look in the forums, you'll find some explainer videos and posts that explain some of the features. We're working hard on updating and adding those> We used to use an FAQ which was challenging to maintain and slowly moving everything into these forums.
-
Great, you've got it running.
Killed the process and restarted it with sudo. The first error message changed (but I understood that it's nothing to worry about) and the second error disappeared. Also in the control panel, the not receiving hops message, is gone now. The dashboard is not showing any details yet. The up and down times continued from where they started last night.
./starter_linux_ocp.sh: line 8: /deviceid: Permission denied
Is now:
./starter_linux_ocp.sh: line 8: $files: ambiguous redirect
It would be best to turn this into a service that auto starts/stops/restarts which I'm sure folks in your operating system's forums or searching Google could help. It is beyond our scope of support.
I should be able to figure this out. I am a linux user for 20 years now, though mainly as a work system and only going under the hood when needed. Interesting though!
How to remove the first incomplete agent installation from the Manage Agents page?
Easy with your explanation :-) Though I would not have thought of looking there. A delete button in the overview page (with the other actions) maybe more intuitive, though also easier to press by mistake.
Glad you got this working. Let me know if you need any more help.
Thanks so much for the quick and detailed replies. Really appreciated!
All the best with the project.
I hope to use it to fight with my provider who keeps 'fixing' it but it keeps going on and off. Since last night it shows it was not working 20% of the time! Especially in the current situation where I need to host and attend many Zoom meetings, a reliable connection is needed more than before.
-
Nice, you got it going.
The line 8 error is interesting. I'm not sure why it would show an error rather than file not found or something like that. When I run it on Centos, I don't see the error and otm just starts. Maybe another OS interprets that a bit differently since it ends up being an empty variable. Maybe comment out that devid line.
As more people start using the forums, I'm sure people will share their scripts and tips on how they have accomplished things. We just fired these up recently and trying to ween support and other questions over to here.
Easy with your explanation :-) Though I would not have thought of looking there. A delete button in the overview page (with the
other actions) maybe more intuitive, though also easier to press by mistake.Yes, we keep trying different things to get rid of extra menus and duplicate things. It just takes time and input like yours to come up with the best ways that seems to work for many.
The dashboard is not showing any details yet. The up and down times continued from where they started last night.
This might help in terms of statuses and what you see.
About: Internet outages, alerts and agent statusesDepending on the connection, cable provider or wireless for example, when there are a bunch of Inactive but no outages, it usually implies that in those cases at least, those were not Internet outages but something else. For example, wireless or model signal problems, bad cable, something else that keeps preventing the agent (and probably other things on the same network) from reaching the Internet.
Keep us posted, we love hearing how people are using the service.
We especially love stories that show how OutagesIO was actually used to find facts and prove the problems.
-
-
Hello,
I am not sure what I am doing wrong but so far since 2-3 days the computer has been on most of the time and only four short outages show from the first day. Of course the agent has been running all the time and shows as active on the website. In reality the internet was running intermittently and sometimes not working for hours at the time. (not on the computer, not on the phone and the internet light on the modem was off, it just kept trying to connect) I must be doing something wrong.
There are also no speed test updates since 30 April.
Any hints would be appreciated :-)
Dada
-
Did you receive the below post?
Everything seems to be running but outages are not recorded…
Hello,
I am not sure what I am doing wrong but so far since 2-3 days the computer has been on most of the time and only four short outages show from the first day. Of course the agent has been running all the time and shows as active on the website. In reality the internet was running intermittently and sometimes not working for hours at the time. (not working on the computer, nor on the phone and the internet light on the modem was off, it just kept trying to connect) I must be doing something wrong.
There are also no speed test updates since 30 April.
Any hints would be appreciated :-)
Dada
-
Hi, sorry, I had not seen this. Kind of new to these forums and didn't realize there was a second page. When I logged in, I'd see the same last post which tells me I've missed a bunch of things and have to look into that next.
Please take a look at this;
About: Internet outages, alerts and agent statusesThis explains what the different statuses mean.
From what I'm reading, while you've experienced some outages, the rest of the problems do not seem to be IP outages.
The agent will always report IP outages so long as it is running and was not turned off during an outage.
If you get disconnections but no outages, this means that the problem is not IP outages but something else is causing Internet loss.
Now things get a little more complicated but it's what you don't see that tells the story.
The first question is how is your Internet provided. Is it by a wireless provider or a cable provider? If it is either of these two forms then you have signal levels at play. Signal levels cannot be monitored by OutagesIO though it is in our list of things to tackle.
If you see many inactive without outages, you are probably experiencing signal level losses and in turn, Internet connectivity issues.
If you don't think this is the problem after checking signal levels, the next thing to look for and taking into account that all other devices are failing, it means something like a cable is going bad, maybe a power supply is giving out, maybe a switch is failing. The problem is right near the router/modem or that is the problem. If you have a router/firewall just before your providers device for example, it could be that too.
If you do have a router/firewall just before your providers device, connect a laptop directly into the providers device and see if the problems persist.
You'll have to carefully check and test everything.
I apologize for not seeing your question earlier. Now I need to check others that I've missed.
-
Hello!
Thanks, I was wondering what happened as your earlier replies were super fast.
I don't fully understand why it would not record the outages. I just have a DSL connection over the land line. The only wireless part is the Wifi after the modem. That part keeps working fine and the internal network is up.
The lights on the modem clearly show when its connected with the internet and when not. The outages website does not seem to capture it. I would think if there is no internet connection the client can't connect with the outages host and it would record it as not being connected.
The main reason for running the client is to be able to show Telmex that over half the time the modem is not connecting.
Hope you have some idea
-
Hi,
If you are not seeing outages in your reports, it means what ever the problems are that you are experiencing are not Internet/IP related. Or at least, they are not IP related in any way that is under your control.
The lights on the modem clearly show when its connected with the internet and when not.
I don't know which modem you have but modems are also signal based and DSL can be prone to interference and signal level issues.
When you say it shows the Internet is connected, what light are you looking at? What does the manual say about it and its meaning?
What I'm asking if we need to know very specifically what this light means on this modem. It may or may not actually be the 'Internet' that is up when this light is on. It might just mean that it has contact with the DSLAM multiplexer in your neighborhood where all of your neighbors DSL connections go to. Meaning, it might only mean an electrical connection and not specifically Internet access.
The outages website does not seem to capture it.
I would think if there is no internet connection the client can't connect with the outages host and it would record it as not being connected.
Correct, that is the purpose of OutagesIO but if there is no actual IP outage then there is nothing to report. The agent will always report an outage if there was one. An outage is specifically related to IP only. OutagesIO cannot determine anything else but not seeing outages does tell you that the problem is not necessarily IP related.
This short article might help;
The nature of TCP/IP and Internet connectivityI'll take a closer look at your reports if you've shared the agent ID in this thread. If it's not here, please share it.
You also mentioned a speed test problem but then seem to say that this is resolved now. Can you confirm.
-
Hello,
I think my agent ID is 31591.
The thing is that the first (and only) 5 events recorded (in April) kind of represent what is happening. The internet connection goes of for seconds, minutes and often many hours. Nothing at all was recorded after these first events. Also the speed test is not updating.
The modem has a light for "power", "internet","DSL / WAN" and beside the LAN ports a "WPS/WLAN"
When power, internet, DSL and WLAN are on, the internet is working without trouble. The internet light indicates that the modem is able to ping/connect with the provider's site, which normally means you can connect anywhere on the internet.
When the internet light is off, the internet connection is off. In that case the DSL light comes on intermittently indicating that it's trying to establish a link.
The main reason the connection is not good is indeed signal strength. However sometimes the signal is just enough to lock on and the Internet light indicates that the modem is able to connect (ping?) successfully with the internet provider at other times it just keep blinking for hours till it catches on again. The technicians know we need new cables and an exchange box closer to our house. They just do not seem to be in a hurry to do it (since 6 months already!) If I can show how bad it really is I may get a higher priority.
As a test, if I unplug the dsl cable should outagesio record it as a disconnection? It does not. Though there are regular small interruptions visible in the heart-beat, they are also not showing with the Outages bars.
Hope this can help :-)
-
I'm taking a closer look.
Can you restart the agent please. It's been running for 3 days and I see that the last speed test was on the 9th which is when it was last restarted.
I think the reason the speed tests aren't coming in is because your bandwidth is below even 1Mbps. Our test runs at 1Mbps minimum which might explain what is going on there. We have been testing code that can report speeds below 1Mbps but it's not released yet.
I can pretty confidently state that the agent would log IP related outages if that is what was going on. It typically will not show an outage if you disconnect the local router because it needs at least one hop beyond the first hop or gateway. Meaning, if you installed the agent so that there is the gateway then another hop, then it would see that next hop down if it was.
Note also that looking at your reports shows a bunch of 10.x.x.x IPs/hops. It looks like your provider's modem is your gateway and then the ISP network, all using the 10.x.x.x network. That makes it a bit hard to decipher what is what.
The setup makes me think your provider is a small one reselling the larger one. Not sure yet. Too bad you can't get a neighbor to use OutagesIO too. Then we could figure out what belongs where.
Let me stare at it some more.
-
@Alex:
I'm taking a closer look.
Can you restart the agent please. It's been running for 3 days and I see that the last speed test was on the 9th which is when it was last restarted. Ok, I restarted it. Actually I saw there were two otm_linux processes running. I killed both and restarted. Now one process is running
I think the reason the speed tests aren't coming in is because your bandwidth is below even 1Mbps. Our test runs at 1Mbps minimum which might explain what is going on there. We have been testing code that can report speeds below 1Mbps but it's not released yet.
Sometimes that may be true but most of the time, when there is a connection, it works for Zoom, YouTube on HD so I would think that is more than 1Mbps.
I can pretty confidently state that the agent would log IP related outages if that is what was going on. It typically will not show an outage if you disconnect the local router because it needs at least one hop beyond the first hop or gateway. Meaning, if you installed the agent so that there is the gateway then another hop, then it would see that next hop down if it was.
Sorry this is a bit too technical for me. I only know that when I first installed it last month it did record 5 outages in the first day. The same keeps going on but now not recorded.
Note also that looking at your reports shows a bunch of 10.x.x.x IPs/hops. It looks like your provider's modem is your gateway and then the ISP network, all using the 10.x.x.x network. That makes it a bit hard to decipher what is what.
I am connected with Private Internet Access VPN. But while initially is was recording the outages (with or without the VPN) Now it has not been recording again with or without the VPN running.
The setup makes me think your provider is a small one reselling the larger one. Not sure yet.
I am using Telmex which is the main provider for the whole of Mexico.
Too bad you can't get a neighbor to use OutagesIO too. Then we could figure out what belongs where.
That will be difficult. I don't know who has internet here and my Spanish is not enough to be able to explain…
Let me stare at it some more.
-
Two processes? Yes, you definitely only want one running.
The speed test runs every hour for example. When I looked at your reports, I saw results that were below 1Mbps.
In terms of the provider, I am making assumptions, trying to guess, to try and understand your setup and situation. Even if this is a large provider, it 'appears' as if there might be a smaller in-between because of the range of IPs I'm seeing. Again, all assumptions.
VPN or no VPN, you are still connecting using IP and that is what the agent is testing. It cannot test or know anything but IP. It does not monitor packets or DNS or how you use your Internet, it only tests the connection from source to a destination. If you are not seeing outages, it is because there are no IP outages causing your problems.
That said, anything is possible. Without neighbors monitoring, it is very difficult to go deep into your situation because there is just not enough information. Maybe you have a friend that is technical and understands IP a little more that could help you? With that persons input, it might give me more information to help you with.
-
I've been keeping an eye out on this and there isn't enough information to know anything yet.
Maybe you can share a little more info.
For example, the providers router/modem, this is the only device on your network that you connect to? Meaning, you do not have a switch or another router before theirs? Your Internet connections are directly from their device?
I ask because I see that your hop one which appears to be your gateway has used four different private network segments.
10.60.10.1, 10.31.1.1, 10.34.10.1 and 10.38.1.1. After that, it goes to a public IP.
In this case, things are not obvious and this happens now and then. I'm sorry it's happening to you and I wish OutagesIO could help you but there is just not enough information and some things are not quite usual.