Book Review: Data-Driven Security: Analysis, Visualization and Dashboards

Book Review: Data-Driven Security: Analysis, Visualization and Dashboards 26

Posted by samzenpus on Monday July 07, 2014 @02:24PM from the read-all-about-it dept.

benrothke writes There is a not so fine line between data dashboards and other information displays that provide pretty but otherwise useless and unactionable information; and those that provide effective answers to key questions. Data-Driven Security: Analysis, Visualization and Dashboards is all about the later. In this extremely valuable book, authors Jay Jacobs and Bob Rudis show you how to find security patterns in your data logs and extract enough information from it to create effective information security countermeasures. By using data correctly and truly understanding what that data means, the authors show how you can achieve much greater levels of security. Keep reading for the rest of Ben's review.

Data-Driven Security: Analysis, Visualization and Dashboards
author	Jay Jacobs and Bob Rudis
pages	352
publisher	Wiley
rating	10/10
reviewer	Ben Rothke
ISBN	978-1118793725
summary	Superb book for effective use of data for information security

The book is meant for a serious reader who is willing to put in the time and effort to learn the programming necessary (mainly in Python and R) to truly understand what information exists deep in the recesses of their logs. As to R, it is a GNU project and a free software programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. For analysis the level of which Jacobs and Rudis prescribe, R is a godsend.

After completing the book, the reader will have the ability to know which questions to ask to gain security insights, and use that data to ensure the overall security of their data and networks. Getting to that level is not a trivial at all a trivial task; even if there are vendors who can promise to do that.

For many people performing data analysis, the dependable Excel spreadsheet is their basic choice for data manipulation. The book calls the spreadsheet a gateway tool between a text editor and programming. The book notes that spreadsheets work as long as the data is not too large or complex. The book quotes a 2013 report to shareholders from J.P. Morgan in which parts of their 2012 $6 billion in losses was due in part to problems with their Excel spreadsheets.

The authors suggest using Excel as a temporary solution for quick one-shot tasks. For those that have repeating analytical tasks or models that are used repeatedly, it's best to move to some type of structured programming language, specifically those that the book suggest and for provides significant amounts of code examples; all of which are available on the companion website here.

The goal of all data extraction is to use data analysis to answer real questions. A large part of the book focuses on how to ask the right question. In chapter 1, the authors write that every good data analysis project begins with setting a goal and creating one or more research questions. Without a well-formed question guiding the analysis, you may wasting time and energy seeking convenient answers in the data, or worse, you may end up answering a question that nobody was asking in the first place.

The value of the book is that it shows the reader how to focus on context and purpose of the data analysis by setting the research question appropriately; rather than simply parsing large amounts of data. It's ultimately irrelevant if you can use Hadoop to process petabytes of data if you don't know what you are looking for.

Visualization is a large part of what this book is about, and in chapter 6 — Visualizing Security Data, the book notes that the most efficient path to human understanding is via the visual sense. It goes on to details the many advantages data visualization has, and the key to making it work.

As important as visualization is, describing the data is equally important. In chapter 7, the book introduces the VERIS(Vocabulary for Event Recording and Incident Sharing) framework. VERIS is a set of metrics designed to provide a common language for describing security incidents in a structured and repeatable manner. VERIS helps organizations collect useful incident-related information and to share that information, anonymously and responsibly with others.

The book shows how you can use dashboards for effective data visualization. But the authors warn that a dashboard is not an art show. They caution that given the graphical nature of dashboards, it's easy to fall into the trap of making them look like pieces of modern or fringe art; when they are far more akin to architectural and industrial diagrams that require more controlled, deliberate and constrained design.

As to dashboards the authors do not like, they consider the Cyber Security Situational Awarenessto be glitzy but not informative. Personally, I thought the dashboard has a lot of good information.

The book uses the definition of dashboard according to Stephen Few, in that it's a "visual display of the most important information needed to achieve one or more objectives that has been consolidated in a single computer screen so it can be monitored at a glance". The book enables the reader to create dashboards like that.

Data-Driven Security: Analysis, Visualization and Dashboards is a superb book written by two experts who provide significant amounts of valuable information in every chapter. For those that are willing to put the time and effort into the serious amount of work that the book requires, they will find it a vital resource that will certainly help them achieve much higher levels of security.

Reviewed by Ben Rothke.

You can purchase Data-Driven Security: Analysis, Visualization and Dashboards from amazon.com. Slashdot welcomes readers' book reviews (sci-fi included) -- to see your own review here, read the book review guidelines, then visit the submission page. If you'd like to see what books we have available from our review library please let us know.

Book Review: Data-Driven Security: Analysis, Visualization and Dashboards

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 26 Comments Log In/Create an Account

Comments Filter:

- Re:Question, what does R do that other lingos cann (Score:4, Informative)
  
  by vux984 ( 928602 ) writes: on Monday July 07, 2014 @02:58PM (#47401851)
  
  Question, what does R do that other lingos cannot?
  Nothing. I'm sure other languages can do everything R can do.
  Does it just have statistical functions built in and ready to go?
  It does have that, along with an active community and growing popularity in scientific circles, so there is lots cutting edge interesting work being done with R -- and a lot of its free and open source. Plus it has multi-core support in several libraries places, and even gpu support in some.
  
  - Re: (Score:1)
    
    by majid_aldo ( 812530 ) writes:
    
    Question, what does R do that other lingos cannot?
    Nothing. I'm sure other languages can do everything R can do.
    Does it just have statistical functions built in and ready to go?
    It does have that, along with an active community and growing popularity in scientific circles, so there is lots cutting edge interesting work being done with R -- and a lot of its free and open source. Plus it has multi-core support in several libraries places, and even gpu support in some.
    since it has cutting-edge stat functions that's plenty of functionality that R has that other languages DON'T have.
    - Re: (Score:2)
      
      by vux984 ( 928602 ) writes:
      
      since it has cutting-edge stat functions that's plenty of functionality that R has that other languages DON'T have.
      MATLAB, Python and other languages have stuff in the same class as R. R is particularly well suited for stats functionality... but its is not UNIQUELY suited for it.
  - true. All languages can do exactly the same things (Score:2)
    
    by raymorris ( 2726007 ) writes:
    
    Question, what does R do that other lingos cannot?
    Nothing. I'm sure other languages can do everything R can do.
    This is an interesting point, which I'm going to veer slightly off topic with. All general purpose programming languages* can do _precisely_ the same things. All fit the requirements to be "Turing complete". ANY Turing complete language "A" can emulate any other Turing complete language "B", and therefore "A" can do the anything that "B" can do. Since "B" can also emulate "A", the two languages can do precisely the same things. (Church-Turing thesis). An interesting example of this is that JavaScript can
    - Re: (Score:2)
      
      by vux984 ( 928602 ) writes:
      
      All general purpose programming languages* can do _precisely_ the same things.
      For a rather broad and mathematically abstract definition of "precisely".
      The Church-Turing thesis applies to computers and computation in the abstract. Actual computer languages on actual hardware may theoretically be able to do the same things in an abstract sense, but not necessarily do precisely the same things with the actual physical hardware they run on.
      Not necessarily due to the language itself, but the nature of how they a
      - Exmpl? If an interpreter for A can be written in B (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        If an interpreter for language A can be written in language B, then B can therefore do everything A does, by running that interpreter. Do you have an example in mind of two languages that can do very different things?
        
        Re: (Score:2)
        
        by vux984 ( 928602 ) writes:
        
        If an interpreter for language A can be written in language B, then B can therefore do everything A does, by running that interpreter.
        Mathematically speaking yes. Practically speaking no.
        Do you have an example in mind of two languages that can do very different things?
        Postscript is Turing complete. Now go write an interpreter for C / C++ with it, and use it to play Call of Duty.
        You can write an interpreter for C/C++ with it.
        Hypothetically speaking it would compile and run the source code for Call of Duty.
        Pr
        
        sure it does. If you sandbox J, it's sandboxed too (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        If you sandbox Java in the browser, or sandbox a plugin written in C, it can't access DirectX either. The fact that people often choose to run a program in a sandbox doesn't mean anything about the language(s) the program is written in. Try writing a C compiler in C. It's not easy in any language. It's possible in any.
        
        ps - I wouldn't want to write COD in Postscript (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        Ps, it would certainly be EASIER to write Call of Duty in some languages than it would in others. It would be difficult to get it to run QUICKLY in some languages (actually that's true of all languages). It could be done, though, and that's point. The question isn't what CAN the language do, the question is what it's best suited for. Just because you CAN write a pixel shader in Perl doesn't mean you should.
        
        Re: (Score:2)
        
        by vux984 ( 928602 ) writes:
        
        The fact that people often choose to run a program in a sandbox doesn't mean anything about the language(s) the program is written in.
        The fact that you can theoretically put any language into a given sandbox or theoretically take it out of one is not equivalent to a real ability to actually do it in the real world today.
        It's not easy in any language. It's possible in any.
        Imagine a turing complete toy language which only operates on binary values. The only implementation of that language allocates a byte
        
        Interesting, but not a Turing machine, unless is (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        We're way off in the weeds here, of course, but that's cool. I don't mind playing in the weeds.
        What you've done there is analogous to Dear Leader's argument "it's Constitutional because it is not a tax and is a tax". You've tried to say "it can write the single value 00000001, which is eight values". Either that's one value or eight, pick one.
        The definition of a Turing machine has requires very few capabilities. One of the very few things required by the definition of a Turing machine is that is has to be
        
        Re: (Score:2)
        
        by vux984 ( 928602 ) writes:
        
        We're way off in the weeds here, of course, but that's cool. I don't mind playing in the weeds.
        Way out there. :)
        You've defined a language that can only update eight bits at a time, and additionally you've said it updates them only in certain patterns. That's not Turing complete.
        No you are mistaken.
        From the point of view of the LANGUAGE, each bit is individually and directly accessible. All the language sees is
        0, 1, 0, 1, 0 ...
        The implementation of the language however, runs on x86, and uses a byte to repres
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  http://en.wikipedia.org/wiki/R_%28programming_language%29
  R is a free software programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software[2][3] and data analysis.[3] Polls and surveys of data miners are showing R's popularity has increased substantially in recent years.[4][5][6]
  R is an implementation of the S programming language combined with lexical scoping semantics inspired by
Basically Bullshit (Score:1)

by gweihir ( 88907 ) writes:

This may have some use against script-kiddies, bot-nets and similarly non-sophisticated adversaries. It is worse than nothing against other adversaries, as it creates a false sense of security.
- Re: (Score:2)
  
  by strikethree ( 811449 ) writes:
  
  Hm. I am going to have to disagree with you there. A false sense of security can be gleaned from such data; however, a false sense of security can be had from NO information at all. A false sense of security is a failing in the security practitioner, not a result of the data. For example, let's say someone has done this analysis and thinks they are secure and then reads your comment. They then know the limitations of what their data can expose and can continue to look for more subtle traces leading to disco

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Book Review: Data-Driven Security: Analysis, Visualization and Dashboards 26

Book Review: Data-Driven Security: Analysis, Visualization and Dashboards More Login

Book Review: Data-Driven Security: Analysis, Visualization and Dashboards

Re:Question, what does R do that other lingos cann (Score:4, Informative)

Re: (Score:1)

Re: (Score:2)

true. All languages can do exactly the same things (Score:2)

Re: (Score:2)

Exmpl? If an interpreter for A can be written in B (Score:2)

Re: (Score:2)

sure it does. If you sandbox J, it's sandboxed too (Score:2)

ps - I wouldn't want to write COD in Postscript (Score:2)

Re: (Score:2)

Interesting, but not a Turing machine, unless is (Score:2)

Re: (Score:2)

Re: (Score:1)

Basically Bullshit (Score:1)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot