Results 1 to 8 of 8

Thread: Excel and large datasets - advice?

  1. #1
    jim
    jim is offline
    HEXUS.clueless jim's Avatar
    Join Date
    Sep 2008
    Location
    Location: Location:
    Posts
    11,457
    Thanks
    613
    Thanked
    1,645 times in 1,307 posts
    • jim's system
      • Motherboard:
      • Asus Maximus IV Gene-Z
      • CPU:
      • i5 2500K @ 4.5GHz
      • Memory:
      • 8GB Corsair Vengeance LP
      • Storage:
      • 1TB Sandisk SSD
      • Graphics card(s):
      • ASUS GTX 970
      • PSU:
      • Corsair AX650
      • Case:
      • Silverstone Fortress FT03
      • Operating System:
      • 8.1 Pro
      • Monitor(s):
      • Dell S2716DG
      • Internet:
      • 10 Mbps ADSL

    Excel and large datasets - advice?

    I've just inherited a woeful Excel doc (all 40MB of it) and am in the process of coming up with ideas to fix it, but I think, technically, it ought to be possible to improve on it by just not using Excel at all.

    The context is: there's a system which tracks all logons, and produces a monthly report with a list of UIDs and when they accessed the system. With a list of staff information (inc. UIDs), the excel doc then runs lookups to find all of the relevant staff info for each and every UID. Then there is a huge list of tables, over a thousand lines long, which show the percentages of people accessing the system by grade, by department, by location, and so on.

    As you can imagine, running that on several months' data, running to thousands and thousands of lines of lookups, it all becomes unbearably slow. What I'm in the process of doing is offloading the analysis to separate workbooks, so that each month is calculated separately, and then it's possible to just paste the values into a final overview doc, month by month (since the old data won't change).

    However, that only feels like half a solution to me - a stop-gap I guess. I'm thinking that Access might be a better solution, because surely you should be obtaining information about what percentage of people accessed the system from a particular office by running a report - not by having pre-emptively created a table with a very precise and bespoke formula to calculate it? Or is my logic flawed?

    If I can use Excel, then it's much easier for me, it just feels... wrong.

  2. #2
    Not a good person scaryjim's Avatar
    Join Date
    Jan 2009
    Location
    Gateshead
    Posts
    15,196
    Thanks
    1,231
    Thanked
    2,291 times in 1,874 posts
    • scaryjim's system
      • Motherboard:
      • Dell Inspiron
      • CPU:
      • Core i5 8250U
      • Memory:
      • 2x 4GB DDR4 2666
      • Storage:
      • 128GB M.2 SSD + 1TB HDD
      • Graphics card(s):
      • Radeon R5 230
      • PSU:
      • Battery/Dell brick
      • Case:
      • Dell Inspiron 5570
      • Operating System:
      • Windows 10
      • Monitor(s):
      • 15" 1080p laptop panel

    Re: Excel and large datasets - advice?

    If it's actual data, get it in Access ASAP. Then consider moving it to SQL Server

    OK, now I'll actually read your post ......

    Without seeing the actual data I can't be 100% sure, but it looks to me like a fairly trivial piece of work for an Access database. Seriously, learn it, get your hands dirty with some VBA to stick a nice dashboard on it, and management will love you Just be sure you *very carefully* document all the business logic in the reporting before you try to reimplement it.

    Then you can print nicely formatted reports, export data for use in other applications (even back into Excel if that's how management want it), whatever: handling that volume of data will be *way* easier in Access.

    p.s. if your employers want to hire an Access consultant for a day or two just to get you going in the right direction....

  3. #3
    Seething Cauldron of Hatred TheAnimus's Avatar
    Join Date
    Aug 2005
    Posts
    17,168
    Thanks
    803
    Thanked
    2,152 times in 1,408 posts

    Re: Excel and large datasets - advice?

    Ah I spent a few years fixing stuff like this.

    40MB isn't much. When you get above 400 it will be more 'fun'.

    For now, I'd have thought modern hardware and 2013 let this work nicely enough. The question is, how big will it have to be in the future?
    throw new ArgumentException (String, String, Exception)

  4. #4
    jim
    jim is offline
    HEXUS.clueless jim's Avatar
    Join Date
    Sep 2008
    Location
    Location: Location:
    Posts
    11,457
    Thanks
    613
    Thanked
    1,645 times in 1,307 posts
    • jim's system
      • Motherboard:
      • Asus Maximus IV Gene-Z
      • CPU:
      • i5 2500K @ 4.5GHz
      • Memory:
      • 8GB Corsair Vengeance LP
      • Storage:
      • 1TB Sandisk SSD
      • Graphics card(s):
      • ASUS GTX 970
      • PSU:
      • Corsair AX650
      • Case:
      • Silverstone Fortress FT03
      • Operating System:
      • 8.1 Pro
      • Monitor(s):
      • Dell S2716DG
      • Internet:
      • 10 Mbps ADSL

    Re: Excel and large datasets - advice?

    Quote Originally Posted by TheAnimus View Post
    Ah I spent a few years fixing stuff like this.

    40MB isn't much. When you get above 400 it will be more 'fun'.

    For now, I'd have thought modern hardware and 2013 let this work nicely enough. The question is, how big will it have to be in the future?
    I think if the 40MB was raw data it would probably be alright, but it's about 5% data and 95% index/matches, just sends the processor into overdrive. Full calc takes about 2 mins on my quad core laptop, on the older machines it just crashes them outright.

  5. #5
    Treasure Hunter extraordinaire herulach's Avatar
    Join Date
    Apr 2005
    Location
    Bolton
    Posts
    5,618
    Thanks
    18
    Thanked
    172 times in 159 posts
    • herulach's system
      • Motherboard:
      • MSI Z97 MPower
      • CPU:
      • i7 4790K
      • Memory:
      • 8GB Vengeance LP
      • Storage:
      • 1TB WD Blue + 250GB 840 EVo
      • Graphics card(s):
      • 2* Palit GTX 970 Jetstream
      • PSU:
      • EVGA Supernova G2 850W
      • Case:
      • CM HAF Stacker 935, 2*360 Rad WC Loop w/EK blocks.
      • Operating System:
      • Windows 8.1
      • Monitor(s):
      • Crossover 290HD & LG L1980Q
      • Internet:
      • 120mb Virgin Media

    Re: Excel and large datasets - advice?

    Access or some other form of database is obviously the right home for at least the lookups, If its anything like our place though you'll have issues getting the source data directly into it. Also, prepare to be the resident IT guy if you do do it and you're not already.

  6. #6
    Senior Member
    Join Date
    Jun 2013
    Posts
    225
    Thanks
    7
    Thanked
    10 times in 9 posts
    • Nipun's system
      • Motherboard:
      • Intel DH67BL
      • CPU:
      • Intel® Core™ i3-2100 Processor
      • Memory:
      • G-skill 4GB DDR 3
      • Storage:
      • 500GB sata HDD
      • Graphics card(s):
      • Intel HD 2000
      • Operating System:
      • Windows 7 32bit

    Re: Excel and large datasets - advice?

    You can use Octave , Matlab or R to do the data analysis .I prefer MATLAB over others
    if(lifeGivesYou == lemons){
    makeLemonade()
    }

  7. #7
    HEXUS.social member finlay666's Avatar
    Join Date
    Aug 2006
    Location
    Newcastle
    Posts
    8,546
    Thanks
    297
    Thanked
    894 times in 535 posts
    • finlay666's system
      • CPU:
      • 3570k
      • Memory:
      • 16gb
      • Graphics card(s):
      • 6950 2gb
      • Case:
      • Fractal R3
      • Operating System:
      • Windows 8
      • Monitor(s):
      • U2713HM and V222H
      • Internet:
      • cable

    Re: Excel and large datasets - advice?

    DB + Lightswitch
    H3XU5 Social FAQ
    Quote Originally Posted by tiggerai View Post
    I do like a bit of hot crumpet

  8. #8
    Member
    Join Date
    Aug 2006
    Posts
    198
    Thanks
    5
    Thanked
    14 times in 14 posts

    Re: Excel and large datasets - advice?

    Definitely a suitable candidate for a database - several source tables and multiple reports/outputs back to applications like Excel etc.

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •