It is with great honor that we wrap up this series of Blogs with an interview with Neal Allen, TAC Level 3 Escalation Engineer at Fluke Networks, a long time veteran of InteropNet, both as a member of the NOC team and also a corporate sponsor.
Q: Neal, in this series of Blog, I try to trace back the steps of InteropNet and SpyNet and also the people who worked hard to make this event a reality. From the horse’s mouth, why did we need SpyNet to begin with? What was InteropNet like 10 years ago? I am going to truncate it to 10 years, if you don’t mind. I know you go back further than that.
A: In a twisted sense, the “old network” was much more fun. Yes, we were sometimes in the convention center for upwards of 20 hours per day for many days before the show opened, but it was so cutting edge we are still bleeding years later. There were times that the compile time stamp (not date) in the router code was some fifteen minutes before the show opened. It was an exciting time. The name of the show “Interop” was selected to portray the interoperability possible with the then generation of technology, which often didn’t play well with others. To accomplish this we usually had the development engineers from the various router and switch vendors sitting side by side with development platforms compiling new code all week to force the switches and routers to work together. Imagine the development engineers from 3COM, Wellfleet, Proteon, etc., all elbow-to-elbow trying to discover why the frame didn’t make it. This show single-handedly contributed significantly to the possibility for multi-vendor networks to operate.
But back to the Spy Network …
Because the show is distributed fairly widely over the convention center, and because we don’t have the luxury of telling the user to “just wait a few minutes”, we instituted the Spy Network to permit the NOC staff to have a media-level link to any part of the network. This was deployed using a parallel cable plant to avoid single-points-of-failure. In Telco terms, we didn’t have to roll a truck. This permitted us to see MAC layer errors and traffic from anywhere without leaving our comfy chairs in the NOC. At the time this was deployed it was pretty revolutionary. These days people set up management VLANs to accomplish similar things, but that does not provide the same level of access, and it goes through the same trunk links as the user traffic.
Of course, due to the nature of the network, there were as many as sixteen parallel networks to deal with, so Spy was in constant use and the biggest problem was determining which protocol was coming out of the link so that you connected the right analysis tool. We used the latest bleeding edge technology as the primary network transport, but always had one or two tried-and-true backup technologies just in case.
Q: That’s great stuff. So over the last ten years, what happened to networking and what happened to network troubleshooting and monitoring?
A: When I first started with the Interop network it could be described as large broadcast domains separated by a few routed connections. That made it very easy to diagnose with a traditional protocol analyzer. Today’s network is entirely switched, and is exceedingly difficult to monitor and troubleshoot. It isn’t good enough to monitor an uplink any longer. As with a typical corporate network we are now faced with having to determine where to start. Only we have a few Alpha or Beta products in exhibitor booths which may or may not be working correctly at this first unveiling, which the normal corporate network usually does not experience.
* Is it the client data?
* Is it routing?
* Is it a loop or alternate path between a VPN to the corporate network and the normal path out of our network?
* Is it traffic from another exhibit?
* Is it some form of virus or Internet based attack?
* Is it the network protection (authentication or firewalls) within our network?
* Is it marginal or faulty cable? Or over saturated wireless…
* Is it a marginal or failed hardware element in our network?
* Is it a Beta product which does not yet operate correctly, but is being introduced at the show?
and so on.
The current trends toward converged networks, and toward having more intelligent switching products, I am seeing a huge need for the diagnostic tool vendors to be even more creative than the product vendors in inventing a few new methods for monitoring the network and for unearthing increasingly more exotic problems. I am not sure what direction that will take, but the need is evident.
Q: Now the big question. What’s next for Spynet and monitoring? What do you see as the big breakthrough going forward?
A: This show has recently been using tapping technologies which are as complex as the switch/routers being monitored. That is a large step in the right direction, but it isn’t enough. I think we will have to get “inside” the switch/router itself and interact with the configured intelligence of the box itself in order to make the next leap forward in monitoring. However, we will still need to see the resulting data stream in and out to both double-check on the box, and to offer adequate monitoring, alerting, and protection.
The Spy Network is needed as much as ever before, perhaps more so. Otherwise how will you know when the switch itself becomes confused? But it will be partnering with “inside the box” monitoring and diagnostics sometime soon.
Thanks, Neal. You have been an inspiration for all of us. And I am sure I will see you again in New York.
Denny K Miu
Gigamon Systems
Part 1: InteropNet - Tribal Customs and Best Practices
Part 2: History of SpyNet (Son of LAN-Hopper)
Part 3: Interop*Spy*Net
Part 4: SpyNet and Network Physics
Part 5: SpyNet and Internap
Part 6: SpyNet and Neal Allen