Page 192 of 253 FirstFirst ... 92142152162172182189190191192193194195202212222232242 ... LastLast
Results 3,057 to 3,072 of 4036

Thread: AMD - Piledriver chitchat

  1. #3057
    Moosing about! CAT-THE-FIFTH's Avatar
    Join Date
    Aug 2006
    Location
    Not here
    Posts
    32,042
    Thanks
    3,909
    Thanked
    5,213 times in 4,005 posts
    • CAT-THE-FIFTH's system
      • Motherboard:
      • Less E-PEEN
      • CPU:
      • Massive E-PEEN
      • Memory:
      • RGB E-PEEN
      • Storage:
      • Not in any order
      • Graphics card(s):
      • EVEN BIGGER E-PEEN
      • PSU:
      • OVERSIZED
      • Case:
      • UNDERSIZED
      • Operating System:
      • DOS 6.22
      • Monitor(s):
      • NOT USUALLY ON....WHEN I POST
      • Internet:
      • FUNCTIONAL

    Re: AMD - Piledriver chitchat

    One of the major limitations with audio encoding is actually the drive/drive interface used when encoding which many sites quietly bypass.



    I used an external USB2 DVD rewriter and iTunes is single threaded.

    I really would love to get my hands on a A8 7600 for example and test it in many games in IGP and Crossfire modes. Don't have any money for any major computer expenditures though ATM!!

  2. #3058
    Not a good person scaryjim's Avatar
    Join Date
    Jan 2009
    Location
    Gateshead
    Posts
    15,196
    Thanks
    1,232
    Thanked
    2,290 times in 1,873 posts
    • scaryjim's system
      • Motherboard:
      • Dell Inspiron
      • CPU:
      • Core i5 8250U
      • Memory:
      • 2x 4GB DDR4 2666
      • Storage:
      • 128GB M.2 SSD + 1TB HDD
      • Graphics card(s):
      • Radeon R5 230
      • PSU:
      • Battery/Dell brick
      • Case:
      • Dell Inspiron 5570
      • Operating System:
      • Windows 10
      • Monitor(s):
      • 15" 1080p laptop panel

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by Noxvayl View Post
    ... Is that going to give the results you are interested in scaryjim? (http://www.phoronix-test-suite.com/?k=home) ...
    Ugh, density of stuffs! A comprehensive test suite is a nice idea, but it'd take weeks to understand what all the tests mean...

    It looks like the only straight audio tests are encoding, which - while it's a part of what I do - is only a small part of my overall process. I might have to work on some tests of my own - I'd love to know where the limiting factor is on, for instance, applying a light vocal reverb to an hour of podcast! Or running compression over a similar length of file. I know those things take several minutes on the computers I've run them on, but I've still not worked out whether it's a CPU bottleneck, RAM bottleneck, IO bottleneck or what...

    I should probably do some more research really, I bet the answer is out there somewhere

  3. #3059
    Ninja Noxvayl's Avatar
    Join Date
    May 2007
    Location
    In the shadows
    Posts
    2,451
    Thanks
    748
    Thanked
    215 times in 173 posts
    • Noxvayl's system
      • Motherboard:
      • GigabyteZ87X-UD4H-CF
      • CPU:
      • Intel i7 4770K
      • Memory:
      • 16GB Corsair Vengaence LPX + 8GB Kingston HyperX Beast
      • Storage:
      • 120GB Snadisk + 256GB Crucial SSDs
      • Graphics card(s):
      • 4GB Sapphire R9 380
      • PSU:
      • ENermax Platimax 750W
      • Case:
      • Fractal Design Define S
      • Operating System:
      • Windows 10 64bit
      • Monitor(s):
      • ATMT + Dell 1024x1280
      • Internet:
      • Sky Fibre

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by scaryjim View Post
    Ugh, density of stuffs! A comprehensive test suite is a nice idea, but it'd take weeks to understand what all the tests mean...

    It looks like the only straight audio tests are encoding, which - while it's a part of what I do - is only a small part of my overall process. I might have to work on some tests of my own - I'd love to know where the limiting factor is on, for instance, applying a light vocal reverb to an hour of podcast! Or running compression over a similar length of file. I know those things take several minutes on the computers I've run them on, but I've still not worked out whether it's a CPU bottleneck, RAM bottleneck, IO bottleneck or what...

    I should probably do some more research really, I bet the answer is out there somewhere
    Tell you what, you do the program that will help me do the testing and then I'll give you all the data As I said earlier I know nothing in this regard, I am learning. I can run that suite and only give you the data from the one test of interest to you; that is if that would be helpful.

    I'll have access to this machine at all times from now until 20 September 2014; I'll also have access to my own machine, my parents machine and another family machine which are all desktops and I have all the details for the bits inside them because I built them. They vary a lot but hopefully you can tell where the bottleneck lies if I do some tests on those machines.

    I would enjoy doing it so if it would benefit you please do sort out that benchmarking program. I would also enjoy learning about the process, something that interests me but never got round to investigating.

  4. Received thanks from:

    scaryjim (08-08-2014)

  5. #3060
    root Member DanceswithUnix's Avatar
    Join Date
    Jan 2006
    Location
    In the middle of a core dump
    Posts
    13,010
    Thanks
    781
    Thanked
    1,568 times in 1,325 posts
    • DanceswithUnix's system
      • Motherboard:
      • Asus X470-PRO
      • CPU:
      • 5900X
      • Memory:
      • 32GB 3200MHz ECC
      • Storage:
      • 2TB Linux, 2TB Games (Win 10)
      • Graphics card(s):
      • Asus Strix RX Vega 56
      • PSU:
      • 650W Corsair TX
      • Case:
      • Antec 300
      • Operating System:
      • Fedora 39 + Win 10 Pro 64 (yuk)
      • Monitor(s):
      • Benq XL2730Z 1440p + Iiyama 27" 1440p
      • Internet:
      • Zen 900Mb/900Mb (CityFibre FttP)

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by scaryjim View Post
    Ugh, density of stuffs! A comprehensive test suite is a nice idea, but it'd take weeks to understand what all the tests mean...

    It looks like the only straight audio tests are encoding, which - while it's a part of what I do - is only a small part of my overall process. I might have to work on some tests of my own - I'd love to know where the limiting factor is on, for instance, applying a light vocal reverb to an hour of podcast! Or running compression over a similar length of file. I know those things take several minutes on the computers I've run them on, but I've still not worked out whether it's a CPU bottleneck, RAM bottleneck, IO bottleneck or what...

    I should probably do some more research really, I bet the answer is out there somewhere
    Yeah I am not a fan of the Phoronix way of testing, seems very undirected and random.

    I suppose it comes down to what package you use and how you use it. If it is something like Audacity which is free then benchmarking should be workable. Would want some realistic task for it to perform though, and an easily available source file.

  6. #3061
    Not a good person scaryjim's Avatar
    Join Date
    Jan 2009
    Location
    Gateshead
    Posts
    15,196
    Thanks
    1,232
    Thanked
    2,290 times in 1,873 posts
    • scaryjim's system
      • Motherboard:
      • Dell Inspiron
      • CPU:
      • Core i5 8250U
      • Memory:
      • 2x 4GB DDR4 2666
      • Storage:
      • 128GB M.2 SSD + 1TB HDD
      • Graphics card(s):
      • Radeon R5 230
      • PSU:
      • Battery/Dell brick
      • Case:
      • Dell Inspiron 5570
      • Operating System:
      • Windows 10
      • Monitor(s):
      • 15" 1080p laptop panel

    Re: AMD - Piledriver chitchat

    I can't help wondering if there's a way to script audacity to perform some common filtering tasks on a set sound file. One of the big things I'm currently doing is noise removal, compression and reverb on ~ 1hour of vocal recording. Each of those tasks takes several minutes to complete, so it'd be good to know where the bottlenecks are.

    I'll do some research and see if I can work something out

    EDIT: audacity has scripting support, but it's basic and experimental - I certainly won't be doing anything with it in the near future...
    Last edited by scaryjim; 08-08-2014 at 04:44 PM.

  7. #3062
    Senior Member
    Join Date
    Mar 2010
    Posts
    2,567
    Thanks
    39
    Thanked
    179 times in 134 posts

    Re: AMD - Piledriver chitchat


  8. Received thanks from:

    kalniel (11-08-2014)

  9. #3063
    Banhammer in peace PeterB kalniel's Avatar
    Join Date
    Aug 2005
    Posts
    31,039
    Thanks
    1,880
    Thanked
    3,379 times in 2,716 posts
    • kalniel's system
      • Motherboard:
      • Gigabyte Z390 Aorus Ultra
      • CPU:
      • Intel i9 9900k
      • Memory:
      • 32GB DDR4 3200 CL16
      • Storage:
      • 1TB Samsung 970Evo+ NVMe
      • Graphics card(s):
      • nVidia GTX 1060 6GB
      • PSU:
      • Seasonic 600W
      • Case:
      • Cooler Master HAF 912
      • Operating System:
      • Win 10 Pro x64
      • Monitor(s):
      • Dell S2721DGF
      • Internet:
      • rubbish

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by HalloweenJack View Post
    Though actually the review isn't of the retail one, but the OEM one the author picked up from a systems integrator sell off

  10. #3064
    Senior Member
    Join Date
    Mar 2010
    Posts
    2,567
    Thanks
    39
    Thanked
    179 times in 134 posts

    Re: AMD - Piledriver chitchat

    meh - didn't see that part , thought it was the retail kit they were using

  11. #3065
    Senior Member
    Join Date
    Mar 2010
    Posts
    2,567
    Thanks
    39
    Thanked
    179 times in 134 posts

  12. #3066
    Senior Member watercooled's Avatar
    Join Date
    Jan 2009
    Posts
    11,478
    Thanks
    1,541
    Thanked
    1,029 times in 872 posts

    Re: AMD - Piledriver chitchat

    Some more details about Denver are being revealed by Nvidia. One thing I keep seeing is big flashy references to 'Dynamic Code Optimisation'.
    http://wccftech.com/nvidias-64bit-de...ocked-250-ghz/

    Judging by how they describe it, hasn't that kinda been a standard feature set of instruction decoders for some time now? And, by 'binary translation', I wonder if they're just referring to uOps, which are pretty much ubiquitous in modern processors of this class?

    Edit: Also, I wonder what they mean by '7-wide' exactly? On the block diagram I see 7 execution ports, like A15 and Krait, but what is its decode width?

    Edit2: I've just looked back at some of the earlier marketing material from Nvidia - they compare A15 being '3-wide' to Denver being '7-wide'. So do they really have a 7-wide front-end to go along with the 7 execution ports, is it some architectural peculiarity to do with their 'binary translation', or have the marketing team just got it arse-about-face?
    Last edited by watercooled; 12-08-2014 at 06:26 PM.

  13. #3067
    root Member DanceswithUnix's Avatar
    Join Date
    Jan 2006
    Location
    In the middle of a core dump
    Posts
    13,010
    Thanks
    781
    Thanked
    1,568 times in 1,325 posts
    • DanceswithUnix's system
      • Motherboard:
      • Asus X470-PRO
      • CPU:
      • 5900X
      • Memory:
      • 32GB 3200MHz ECC
      • Storage:
      • 2TB Linux, 2TB Games (Win 10)
      • Graphics card(s):
      • Asus Strix RX Vega 56
      • PSU:
      • 650W Corsair TX
      • Case:
      • Antec 300
      • Operating System:
      • Fedora 39 + Win 10 Pro 64 (yuk)
      • Monitor(s):
      • Benq XL2730Z 1440p + Iiyama 27" 1440p
      • Internet:
      • Zen 900Mb/900Mb (CityFibre FttP)

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by watercooled View Post
    And, by 'binary translation', I wonder if they're just referring to uOps, which are pretty much ubiquitous in modern processors of this class?
    I thought only x86 CPUs used uOps, because they are the only ones trying to run a dog's dinner of an instruction set. The whole point of RISC is easy & hence direct decode.

  14. #3068
    Senior Member watercooled's Avatar
    Join Date
    Jan 2009
    Posts
    11,478
    Thanks
    1,541
    Thanked
    1,029 times in 872 posts

    Re: AMD - Piledriver chitchat

    AFAIK ARM (or at least some of the higher performance OoO cores) use uOps too, although most instructions are decoded to one uOp. I'll see if I can find some more details and post back.

    Edit: Found some reference at ARM and Anandtech:
    http://infocenter.arm.com/help/topic.../BABBGJHI.html
    http://www.anandtech.com/show/7126/t...e-cortex-a12/3

    I'm not exactly sure how they compare to x86 uOps though.

    Edit2: Although 'ubiquitous' was probably too strong a term, having read around a bit more. I think I've seen the reference to ARM uOps in the past and read to much into it.

    Also, I've just spotted this (unrelated) while scanning through some posts:
    Quote Originally Posted by DanceswithUnix View Post
    Good, though I still wonder just where AMD are headed these days.

    Now see those blocks marked "Decode" that convert amd64 instructions into internal uops, can they make that do ARM instructions I wonder...
    http://forums.hexus.net/cpus/241925-...ml#post2747673
    Interesting prediction.
    Last edited by watercooled; 13-08-2014 at 12:24 AM.

  15. Received thanks from:

    Noxvayl (17-08-2014)

  16. #3069
    root Member DanceswithUnix's Avatar
    Join Date
    Jan 2006
    Location
    In the middle of a core dump
    Posts
    13,010
    Thanks
    781
    Thanked
    1,568 times in 1,325 posts
    • DanceswithUnix's system
      • Motherboard:
      • Asus X470-PRO
      • CPU:
      • 5900X
      • Memory:
      • 32GB 3200MHz ECC
      • Storage:
      • 2TB Linux, 2TB Games (Win 10)
      • Graphics card(s):
      • Asus Strix RX Vega 56
      • PSU:
      • 650W Corsair TX
      • Case:
      • Antec 300
      • Operating System:
      • Fedora 39 + Win 10 Pro 64 (yuk)
      • Monitor(s):
      • Benq XL2730Z 1440p + Iiyama 27" 1440p
      • Internet:
      • Zen 900Mb/900Mb (CityFibre FttP)

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by watercooled View Post
    I'm not exactly sure how they compare to x86 uOps though.

    Edit2: Although 'ubiquitous' was probably too strong a term, having read around a bit more. I think I've seen the reference to ARM uOps in the past and read to much into it.
    We can only guess without a lot more information. Or possibly asking the question, there have been "ask ARM" events on the web in the past.

    But my guess is that they aren't that similar. ARM has some baggage from the 32 bit days, some must be quite irritating for them, and the CPU has modes for 32 bit and 64 bit so it can run more than one instruction set. I can't remember any specifics in ARM, but RISC cpus can have quite long instructions (like the Power architecture push multiple for procedure entry) that will internally be expanded up to lots of instructions like a macro expansion. However, there I think the similarities with x86 end.

    x86 is an architecture that no sane person would create today. It was rubbish when 8086 was new. In the face of such madness, translating the instruction stream into one suitable for multi issue makes sense.

    ARM V8 on the other hand looks far more sane. I would expect the uOps to be close to V8 native with some macro operations to improve code density where it makes sense. But then the devil is in the details with these things, perhaps there is a non obvious reason why translation would be more aggressive.

    Also, I've just spotted this (unrelated) while scanning through some posts:

    http://forums.hexus.net/cpus/241925-...ml#post2747673
    Interesting prediction.
    lol, well that just seems an obvious thing to fall out of AMD. They have their own problem to solve, two completely different architectures, and for AMD64 two different implementations with the cats and the big cores. The more they can share across the designs, the better use they can make of their limited engineering resources.

    Now, if they could merge the designs for the cat cores so that their tablet designs could run both AMD64 and ARM code that would be rather interesting. Not sure it would make commercial success as a product idea, but it reduces the number of designs they work on.

    Edit to add: Have read up some more of the latest info on Denver. Nvidia originally wanted to make an x86 chip, and I think it still shows. Static 7 way issue? The rule of thumb is that every 6th instruction is a jump, and I don't think anyone has cracked predicting two dependent jumps in a single cycle yet. Making use of more than 3 issue is hard even with out of order. So this just has to be a VLIW design. That, and the translation cache, and the way it seems to have software assist running at a level beneath the native OS, this really sounds like a new Transmeta design.
    Last edited by DanceswithUnix; 13-08-2014 at 10:20 AM.

  17. #3070
    Senior Member watercooled's Avatar
    Join Date
    Jan 2009
    Posts
    11,478
    Thanks
    1,541
    Thanked
    1,029 times in 872 posts

    Re: AMD - Piledriver chitchat

    According to this (not sure if they're referencing Nvidia or if it's their own take on it) http://hothardware.com/News/Nvidias-...a-Rides-Again/ they're claiming it's not VLIW?

    In terms of issue width, isn't that the reason ARM A7, Saltwell, Silvermont, Bobcat, Jaguar are all 2-issue for power efficiency? It's why I'm thinking the '3-issue vs 7-issue' comparison isn't really a fair one.

    Though, even with compiler-scheduled code on graphical workloads like on GPUs, AMD still saw it fit to drop from 5-wide to 4-wide VLIW to make better use of resources? I'm not sure if that's directly comparable, mind, but I would've thought it was easier to extract ILP from graphical workloads vs doing it dynamically on CPU workloads?

    The Denver scheduling hardware must be massive - they claim to have chosen to not use hardware OoO for efficiency/die size reasons, but Denver looks massive, assuming it's made using the same libraries, node.

  18. #3071
    root Member DanceswithUnix's Avatar
    Join Date
    Jan 2006
    Location
    In the middle of a core dump
    Posts
    13,010
    Thanks
    781
    Thanked
    1,568 times in 1,325 posts
    • DanceswithUnix's system
      • Motherboard:
      • Asus X470-PRO
      • CPU:
      • 5900X
      • Memory:
      • 32GB 3200MHz ECC
      • Storage:
      • 2TB Linux, 2TB Games (Win 10)
      • Graphics card(s):
      • Asus Strix RX Vega 56
      • PSU:
      • 650W Corsair TX
      • Case:
      • Antec 300
      • Operating System:
      • Fedora 39 + Win 10 Pro 64 (yuk)
      • Monitor(s):
      • Benq XL2730Z 1440p + Iiyama 27" 1440p
      • Internet:
      • Zen 900Mb/900Mb (CityFibre FttP)

    Re: AMD - Piledriver chitchat

    Quote Originally Posted by watercooled View Post
    According to this (not sure if they're referencing Nvidia or if it's their own take on it) http://hothardware.com/News/Nvidias-...a-Rides-Again/ they're claiming it's not VLIW?

    In terms of issue width, isn't that the reason ARM A7, Saltwell, Silvermont, Bobcat, Jaguar are all 2-issue for power efficiency? It's why I'm thinking the '3-issue vs 7-issue' comparison isn't really a fair one.

    Though, even with compiler-scheduled code on graphical workloads like on GPUs, AMD still saw it fit to drop from 5-wide to 4-wide VLIW to make better use of resources? I'm not sure if that's directly comparable, mind, but I would've thought it was easier to extract ILP from graphical workloads vs doing it dynamically on CPU workloads?

    The Denver scheduling hardware must be massive - they claim to have chosen to not use hardware OoO for efficiency/die size reasons, but Denver looks massive, assuming it's made using the same libraries, node.
    I think the point here is that the scheduling hardware is tiny. I suspect it isn't out of order because it isn't even doing dependency checks on the instruction stream. If the translation/optimization layer makes sure all is well then they shouldn't have to.

    So they can call it all they want, but issuing 7 ops in parallel in a static design must at the very least borrow very heavily from VLIW. I don't see that as a bad thing, they have enough successful previous in graphics architecture to know how to make that fly if anyone can.

    I don't think Intel/HP liked calling Itanium VLIW either. Intel were trying to make this work at the high end of the range which I think is way harder, trying to do it at the lower end might just work.

  19. #3072
    Senior Member watercooled's Avatar
    Join Date
    Jan 2009
    Posts
    11,478
    Thanks
    1,541
    Thanked
    1,029 times in 872 posts

    Re: AMD - Piledriver chitchat

    I'm not denying it's VLIW likeness BTW, just some articles are claiming so.

    Hmm, so if the scheduling etc is done in software, does that mean it literally runs on the cores? I was kind of interpreting the 'software' to be something running on independent microcontrollers, kind of like the dispatch processors we see on GPUs. But reading back through, it seems that's not the case?

Thread Information

Users Browsing this Thread

There are currently 38 users browsing this thread. (0 members and 38 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •