• UP TO 5X FASTER
      SAS RUNTIMES...
      WITH FEWER CORES
    • SAS ANALYTICS

ACCELERATING SAS ANALYTICS WITH PARALLEL FILE SYSTEMS

“The GRIDScaler parallel file system is an excellent choice for SAS Grid deployments. IO intensive SAS Grid workloads have demonstrated excellent performance characteristics utilizing this storage appliance” —Cheryl Doninger, Senior Director, Research and Development, SAS

More than 50% of the largest oil and gas companies, 40% of the leading financial services companies, and 30% of the top aerospace and automotive companies have deployed DDN solutions to address performance, scalability, and TCO challenges in their organizations. Built from the ground, DDN’s GRIDScaler (IBM® Spectrum Scale™) and EXAScaler (Lustre* FS) parallel file system solutions are the next-generation analytics storage platforms that eliminate performance bottlenecks, simplify environment, and greatly increase return on investment.

Solution Advantages

  • 4.5X improvement in end-to-end workflow performance
  • 400% higher throughput per core
  • More than 4x SAS Grid workflows executed on DDN solutions vs. competing solutions in the same amount of time
  • More than 50% reduction in data center footprint
  • Elimination of data silos and simplification of data management infrastructure

BENEFITS

A formidable challenge for Big Data analytics is architecting a high-performance infrastructure to handle rapid and unpredictable data growth at a reasonable price point. Traditional NAS and SAN enterprise storage protocols (NFS, CIFS, iSCSI, FC) are designed more for traditional back-office applications and file-sharing repositories. These protocols are point-to-point and are not designed for concurrent access of data from multiple applications. Furthermore, the SAS Grid computing tool consolidates islands of analytics hardware, increasing the performance requirements for shared, backed storage beyond what traditional protocols can offer.

A parallel file system offers several advantages over a single, direct-attached file system or traditional network-attached storage. When DDN parallel file storage solutions are used in conjunction with SAS Analytics, the advantages include:

  1. A significant decrease in operational latency and high bandwidth data transfer, especially for data intensive and time-sensitive SAS Analytics workflows, by balancing content around multiple file system servers.
  2. Separation of data and metadata, enabling optimized performance and accessibility and delivering lower-latency file-system metadata access, thereby eliminating I/O bottlenecks.
  3. Linear scalability of bandwidth and I/O, which delivers significantly higher aggregate performance over traditional architectures that are extremely complex and expensive to scale up, often resulting in performance degradation.
  4. Parallelization of SAS Analytics in a single, shared namespace, allowing a user to treat any data-intensive workload independently from other data-intensive workloads because of efficient file-locking.
  5. Consolidation of data silos while delivering higher performance, scalability, and reliability architectures with no single point of failure, thereby allowing users to simplify data management, minimize datacenter footprint and licensing, and cut operations costs.