Next-gen PS5 and next Xbox speculation launch thread |OT5| - It's in RDNA

Xeontech · Jun 14, 2019

Colbert said:
Updated my prediction.
You can find it under the Threadmark

Xeontech That is my avatar bet: 10.7TF Max. Yours is still 12.9TF?

Yes! Bet is on :)

I say One of the consoles will be at least 12.9tf or more, you say 10.7tf or less. Week long avatar bet.

If it happens to be in between, like 11.somthing we'll call it a stalemate. Sound good?

BreakAtmo · Jun 14, 2019

DavidDesu said:
I'm still stunned at TLOU2 gameplay we've seen and that "next gen" animation blending they're doing. I cannot actually imagine what they'll do on hardware that's a massive leap up from the PS4/Pro. I feel like we're going to get a super enjoyable console to use, fast and snappy, and a feeling of little compromise when it comes to the graphics, unlike perhaps how we feel now with 3rd parties up against PC. I feel like the consoles are going to be much closer and on par than this current generation has been. Can't honestly wait. Hopefully Sony can give us enhanced BC cos I really want rock solid 60fps/4K enhanced games from this gen.

I've wondered if motion matching will be improved beyond TLOU2 in the next gen, or if it's more that we'll just see it in every game rather than a single really advanced one.

Andromeda · Jun 14, 2019

sncvsrtoip said:
Still remember rumor that xbox is more "advanced" so maybe rdna 2?;) and sony with rdna 1 but with own ray tracing implementation

Maybe. But we can also remember than the 2 previous Sony consoles where more advanced than Xbox consoles. And Sony last one was released one year before the Xbox and was still more advanced.

Deleted member 1120 · Jun 14, 2019

People are getting really hung up on the whole "custom designed" and "collaborated with" when it comes to Sony and Microsoft working with AMD. I remember these arguments back in the early parts of this generation with Xbox using custom parts and PS4 using "off the shelf" components and it turned out that both of them were pretty much identical with customizations here and there. I feel this is what's going to happen again. It's kind of like Microsoft not wanting to say Jaguar CPU in the One X and choosing the say custom designed CPU, and people fell for the PR, and it's really just jaguar. Also how people took the FP16 features of the PS4 Pro and overblew them. With it having that feature of Vega, I'm sure Sony and Microsoft get to choose which customizations they want to add to their next gen consoles but it's not like they're going to be gamechanging customizations.

Brown Intruder · Jun 14, 2019

anexanhume said:
Look at those sources at the bottom and you tell us :)

Parasite76 is a mrxmedia acolyte lol

I need to get some nasty bass over here, because I'm feeling all Alexander O'Neal

https://www.youtube.com/watch?v=LIs4a1Sjucg

MrKlaw · Jun 14, 2019

19thCenturyFox said:
I still expect the RTX 2060 to be next gens 750 Ti, aka the PC card that will match or beat console perfomance and console settings at console resolutions throughout next gen. At the absolute most I expect the ceiling to be the RTX 2070 but I really doubt that is going to happen outside of some cherry picked benchmarks.

A 2060/2070 level GPU, proper PC level 8 core CPU, and PCIe4 level SSD will be a fantastic baseline for the next gen

chris 1515 · Jun 14, 2019

BreakAtmo said:
I've wondered if motion matching will be improved beyond TLOU2 in the next gen, or if it's more that we'll just see it in every game rather than a single really advanced one.

Yes it can be used and they will even go further with motion synthesis using AI to improve motion matching

https://schedule.gdconf.com/session...nthesis-and-all-the-hurdles-in-between/864956

ody · Jun 14, 2019

Ōkami Haundo said:
So I don't have the means to quote it right now, but Benji mentioned an interesting Phil Spencer quote on Twitter. Mainly that he supposedly confirmed cross-gen games on Scarlett will be upgraded with no extra fees. Have we talked about that? I know I've seen plenty of debate as to whether or not we'll get two SKU's of cross-gen titles.

No, he didn't confirm it. You would have to watch the Giant Bomb interview for the exact quote, but I believe he stated his expectations are that the process would be similar to 360 BC

Putty · Jun 14, 2019

anexanhume said:
Look at those sources at the bottom and you tell us :)

Outstanding source. What say you Colbert? 😂

Pheonix · Jun 14, 2019

Thanks, Colbert for the update. Goes in line with what I am personally expecting. To the letter. My range or Navi was 10TF -10.5TF. Which falls within my GCN range of 10-12TF lol.

I am curious as to the Mem bandwidth thing though. That a lot of potential bandwidth for sony to give u just to use a cheaper PCB.... especially if also using GDDR6.

19thCenturyFox · Jun 14, 2019

MrKlaw said:
A 2060/2070 level GPU, proper PC level 8 core CPU, and PCIe4 level SSD will be a fantastic baseline for the next gen

Indeed, I'm very happy with how these consoles are shaping up. Not to mention that with CBR and VRS they can punch above their weight performance wise with minimal impact on the graphics quality.

SharpX68K · Jun 14, 2019

sncvsrtoip said:
Don't look at 1080p as cpu is limiting here and also amd drivers is not as good as nvidia's and it's more visible in lower resolution. Also nextgen consoles will not target 1080p (hopefully lockhart is dead)

I hope so to.

That said, imagine what Anaconda and PS5 could do graphically, if an AAA developer targeted 1080p, choosing to push the most "CGI-like" visuals as they can.
i.e. Toy Story in KH3, but going much further. Nobody should be barred from doing that, and yet another reason Lockhart is a bad idea. If Anaconda was used to do something utterly spectacular like a realtime version of Toy Story 2 or 3 (or insert your favorite CG movie/cutscene) in 1080p, where who that leave Lockhart, sub HD?
Obviously things like that would be the exception, not the rule.

anexanhume · Jun 14, 2019

Pheonix said:
Thanks, Colbert for the update. Goes in line with what I am personally expecting. To the letter. My range or Navi was 10TF -10.5TF. Which falls within my GCN range of 10-12TF lol.

I am curious as to the Mem bandwidth thing though. That a lot of potential bandwidth for sony to give u just to use a cheaper PCB.... especially if also using GDDR6.

Navi's larger caches will help reduce memory bandwidth dependency.

VX1 · Jun 14, 2019

Xeontech said:
I say One of the consoles will be at least 12.9tf or more

After what we know about Navi this seems very unrealistic to me even for $499 console.

Anthony Hopkins · Jun 14, 2019

Andromeda said:
Maybe. But we can also remember than the 2 previous Sony consoles where more advanced than Xbox consoles. And Sony last one was released one year before the Xbox and was still more advanced.

Having fp16 = more advanced
Is debatable.

RevengeTaken · Jun 14, 2019

Y

MrKlaw said:
A 2060/2070 level GPU, proper PC level 8 core CPU, and PCIe4 level SSD will be a fantastic baseline for the next gen

yeah it could basically deliver that ray traced troll demo in-game easily!

anexanhume · Jun 14, 2019

Anthony Hopkins said:
Having fp16 = more advanced
Is debatable.

Nah. RPM alone can give 10% improvement.

Deleted member 12635 · Jun 14, 2019

Xeontech said:
Yes! Bet is on :)

I say One of the consoles will be at least 12.9tf or more, you say 10.7tf or less. Week long avatar bet.

If it happens to be in between, like 11.somthing we'll call it a stalemate. Sound good?

DEAL!
If you lose, you have to wear this for a week:

Deleted member 12635 · Jun 14, 2019

Putty said:
Outstanding source. What say you Colbert? 😂

Fuck Zombie invasion.

DukeBlueBall · Jun 14, 2019

anexanhume said:
Navi's larger caches will help reduce memory bandwidth dependency.

It seems to be still 50GB/s for every teraflop.

I.e. the ~9 teraflops in RX 5700 XT for 448GB/s

isahn · Jun 14, 2019

anexanhume said:
Nah. RPM alone can give 10% improvement.

speaking of which, has Navi got RPM?

Deleted member 12635 · Jun 14, 2019

isahn said:
speaking of which, has Navi got RPM?

yes

anexanhume · Jun 14, 2019

isahn said:
speaking of which, has Navi got RPM?

And Delta Color Compression (DCC), which helps too.

Fafalada · Jun 14, 2019

anexanhume said:
Look at those sources at the bottom and you tell us :)

Hey now, Lisa did say it very clearly.
PS5 = "Next generation Radeon archictureS"
XBox="RDNA"
It's pretty clear to me this confirms PS5 is the amalgation of all the future GPU architectures AMD plans to build, while new XBox is just a Navi.

Fastidioso · Jun 14, 2019

Fafalada said:
Hey now, Lisa did say it very clearly.
PS5 = "Next generation Radeon archictureS"
XBox="RDNA"
It's pretty clear to me this confirms PS5 is the amalgation of all the future GPU architectures AMD plans to build, while new XBox is just a Navi.

That's a weird assumption. Wasn't Sony who wanted "desperately" Navi for their hardware and now MS uses exclusively it meanwhile sony adds next gen stuff to their Navi? Doesn't make sense to me.

Deleted member 4761 · Jun 14, 2019

Fafalada said:
Hey now, Lisa did say it very clearly.
PS5 = "Next generation Radeon archictureS"
XBox="RDNA"
It's pretty clear to me this confirms PS5 is the amalgation of all the future GPU architectures AMD plans to build, while new XBox is just a Navi.

Sony:
"Working collaboratively with Sony over the last few years to build a custom chip based on the Zen 2 and next gen Radeon Architectures."

Microsoft:
"Custom Processor co-engineered between our team and the Microsoft team, with one goal in mind, to deliver the ultimate gaming experience. It uses our Zen 2 CPU Core and next generation Radeon RDNA gaming graphics architecture."

anexanhume · Jun 14, 2019

Fafalada said:
Hey now, Lisa did say it very clearly.
PS5 = "Next generation Radeon archictureS"
XBox="RDNA"
It's pretty clear to me this confirms PS5 is the amalgation of all the future GPU architectures AMD plans to build, while new XBox is just a Navi.

They both said Navi. The rest is PR fluff until we get architecture details. We don't even know if their HW RT implementations are the same or derived from RDNA 2.0, as likely as that may be.

El-Pistolero · Jun 14, 2019

Fastidioso said:
That's a weird assumption. Wasn't Sony who wanted "desperately" Navi for their hardware and now MS uses exclusively it meanwhile sony adds a next gen stuff to their Navi? Doesn't make sense to me.

anexanhume said:
They both said Navi. The rest is PR fluff until we get architecture details. We don't even know if their HW RT implementations are the same or derived from RDNA 2.0, as likely as that may be.

bcatwilly · Jun 14, 2019

It is still of some interest that Microsoft specifically said "real-time hardware accelerated ray tracing" explicitly as compared to what Sony said about ray tracing, and it will remain that way until Sony provides further detail on their ray tracing implementation. I know that one of the mods on here Matt said that he had heard they will both have some type of hardware ray tracing, but there is a whole spectrum of what that could mean of course in practice until both sides reveal the architectures fully.

anexanhume · Jun 14, 2019

El-Pistolero said:

Sorry there's so much FUD in here all I see are bogies 😬

DukeBlueBall · Jun 14, 2019

Same salad, different dressing.

Locuza · Jun 14, 2019

TheThreadsThatBindUs said:
Question to the known devs in here about RT:

I'm curious, do you expect RT to play a big role in next-gen console games?

Or are we still likely to see baked lighting and more novel approximations be prevalent?

cc Locuza, 40KAl, Fafalada

I'm just a layman not a dev.

But I think every answer from today's standpoint will be vague even if you are asking a dev.
I expect rather limited integration of ray tracing and see the current Geforce RTX hardware as a usable reference point.

Miniature Kaiju said:
I can't imagine AMD launching Epyc 2 without AVX512 and, since those would be basically Zen 2 modules, I'd imagine the design is able to support it, so it would be a matter of downporting the feature.

If they do launch Epyc 2 without AVX512 then damn, wtf AMD?

Vector maths can massively speed up some common coding structures (i.e. loops), and AVX512 can handle really wide loops, so it would be a massive boost to code execution on the CPU. Besides that, there's a bunch of speculative uses for CPU vector maths, from pathfinding and AI to RT. It's less about new things (and I'm not well versed in this enough to be able to imagine new things nonetheless) and more about making current thing much, much faster (so you can imagine more of them and better).

There is no AVX512 support in Zen 2.
AMD kept it simple and made an upgrade from 128-Bit pipes to 256-Bit.
They use relatively lightweight cores and more of them.
For multithreaded applications you still get good vector performance.

dgrdsv said:
I dunno why they've decided to call CUs what is essentially half of a multiprocessor - and call the last one Work Group Processor. But yeah, there's 10 WGPs in Navi 10, each comprised of two "CUs". One could argue that WGP is actually a new CU in RDNA as it mostly fits the same requirement as a CU did in GCN.

dgrdsv said:
[...] We don't know really. Looking at this you could argue that one RDNA SIMD is able to do what you're describing as "CU" as each of them have its own scheduler now. The fact that two of RDNA "CUs" share caches and LDS mean more than what unit is able to fetch/decode/etc. It's clear that AMD tried to fit the new h/w into GCN metrics to make comparisons easier but I wonder if they've muddied the waters so to speak while doing so. Thus far what GCN had as a CU was basically an equivalent of NV's multiprocessor (SM, streaming multiprocessor), with RDNA this seem to change with CU being a pair of SIMDs while four of them (same as in GCN, btw) comprise a WGP (note the P there which stands for "processor" too) - which seem to suit NV's SM metrics better than RDNA's CU would.

RDNA has multiple working modes on different levels.

Wave32 or Wave64 is used depending on the workload.
Depending on the circumstances Wave32 or Wave64 is more efficient.

On a higher level a RDNA Compute Unit can still be compared to one GCN Compute Unit because under RDNA a Compute Unit can still work indepentendly.
Every Compute Unit can work on a Workgroup, which is also called CU mode, where the registers and caches are all exclusive per Compute Unit.

But it's also possible to have two Compute Units working on a Workgroup together and sharing ressources, this is called WGP (Work Group Processor) mode.

Pheonix · Jun 14, 2019

anexanhume said:
Navi's larger caches will help reduce memory bandwidth dependency.

Yh I get that much from that nexus guy youtube breakdown. But if there is a choice to go with a 384bit bus ad still ony have 16GB'ish of RAM... then why not do it? What the benefit of gong with a smaller 256bit bus? Less space is taken up in the chip for mem controllers? Less power draw? Cheaper PCB

DukeBlueBall · Jun 14, 2019

Pheonix said:
Yh I get that much from that nexus guy youtube breakdown. But if there is a choice to go with a 384bit bus ad still ony have 16GB'ish of RAM... then why not do it? What the benefit of gong with a smaller 256bit bus? Less space is taken up in the chip for mem controllers? Less power draw? Cheaper PCB

Smaller PCB, less die space usage, less power used by less amount of chips and less buses.

anexanhume · Jun 14, 2019

Pheonix said:
Yh I get that much from that nexus guy youtube breakdown. But if there is a choice to go with a 384bit bus ad still ony have 16GB'ish of RAM... then why not do it? What the benefit of gong with a smaller 256bit bus? Less space is taken up in the chip for mem controllers? Less power draw? Cheaper PCB

Correct on the first two. Won't impact PCB cost.

Xeontech · Jun 14, 2019

Colbert said:
DEAL!
If you lose, you have to wear this for a week:

Hahaha this will be yours

bear force one · Jun 14, 2019

bcatwilly said:
It is still of some interest that Microsoft specifically said "real-time hardware accelerated ray tracing" explicitly as compared to what Sony said about ray tracing, and it will remain that way until Sony provides further detail on their ray tracing implementation. I know that one of the mods on here Matt said that he had heard they will both have some type of hardware ray tracing, but there is a whole spectrum of what that could mean of course in practice until both sides reveal the architectures fully.

Matt already confirmed Sony using hardware too.

Xeontech · Jun 14, 2019

VX1 said:
After what we know about Navi this seems very unrealistic to me even for $499 console.

Oh I know, seems illogical today.

But wait till you see the leaks coming out over the next year and a half. 10tf will look like a wet sock.

dgrdsv · Jun 14, 2019

Miniature Kaiju said:
Around 7 minutes-ish into the video. Wave64 seems to behave a bit worse when it stalls due to dependencies. Not by much, but it's there. They also highlight the Wave32 is the shortest execution path.

Well, a wider wave will have a bigger impact when it stalls due to the nature of a wider wave - it has more data in it. It will also have a bigger waste of resources if only part of said wave goes into execution. That's expected. The question is if there are any benefits at all to running 64 wide waves over 32 ones.

anexanhume said:
Yup, I had the same thought. And we have no idea what their approach to denoising is.

All current DXR titles denoising were pure compute added to TAA routines. Denoising is kinda trivial.

gofreak said:
I'm not sure it tells us much about the 'how' of the implementation, as much as that they'll, in some form, support instructions for bvh intersection tests. I guess it's still a wide open question how that'll be implemented? Or if it'll be accelerated in all the same ways, as in RTX.

Them highlighting BVH intersection tests point to this being their primary acceleration point. Why would they highlight this portion specifically otherwise? I agree that there are many options in how they may approach the acceleration though.

TheThreadsThatBindUs said:
On the PrSh / NGG fastpath diffrrence, AMD definitely makes a clear distinction between the two. Check the Vega whitepaper.

When you say Primitive Shaders are still implimented on the CUs, are you referring to the geometry after the degenrate primitives have been discarded?

I don't know but I kinda assumed the Primitive Shaders were simply a more programmable version of Polaris's fixed function Primitive Discard Accelerator.

The way I understand it the primitive shader runs on incoming geometry stream and culls invisible triangles. This workload is handled by general ALUs as it would be with any shader. Vega isn't really a great reference as neither work as intended on it so we can't really know what they intended for either.

TheThreadsThatBindUs said:
Asynch-compute?

Is denoising so computationally intensive that something like Tensor Cores are needed - or is it a memory access issue?

Does RTX even use Tensor cores for denoising, because I thought I read somewhere NVidia still uses CUDA cores for it.

NV has tensor denoiser but no one uses it. One reason is likely because it would tie your code to NVAPI as said denoiser isn't accessible through DX (or wasn't until DXML release with Win10 1903 at least). Another is that tensors seemingly can't run alongside main SIMDs due to internal b/w constraints which means that you can use either/or anyway.

Locuza said:
RDNA has multiple working modes on different levels.

Wave32 or Wave64 is used depending on the workload.
Depending on the circumstances Wave32 or Wave64 is more efficient.

On a higher level a RDNA Compute Unit can still be compared to one GCN Compute Unit because under RDNA a Compute Unit can still work indepentendly.
Every Compute Unit can work on a Workgroup, which is also called CU mode, where the registers and caches are all exclusive per Compute Unit.

But it's also possible to have two Compute Units working on a Workgroup together and sharing ressources, this is called WGP (Work Group Processing) mode.

I don't know if this answers any of my questions really. My point still stands: instead of calling half of WGP a "CU" they would be better off calling the WGP a new CU, much like NV did back with Maxwell for example, calling the new SM "SMX".

TUAXK · Jun 14, 2019

New, old rumors:

12 core
80Cu's
24Gb
14tf

anexanhume · Jun 14, 2019

TUAXK said:
New, old rumors:

12 core
80Cu's
24Gb
14tf

LOL. 600mm^2 die here we come.

dgrdsv · Jun 14, 2019

Yeah, expecting anything more than 10TF Navi is unrealistic at this point - unless next gen consoles won't use the same process as Navi 10.

flyingman · Jun 14, 2019

I firmly believe Sony will have advantage over MS console(s)

Locuza · Jun 14, 2019

dgrdsv said:
[...]
I don't know if this answers any of my questions really. My point still stands: instead of calling half of WGP a "CU" they would be better off calling the WGP a new CU, much like NV did back with Maxwell for example, calling the new SM "SMX".

You could argue either way because it's kinda in the middle of it.

For example in WGP mode you have 4x SIMD32 working on the same Workgroup, sharing the registers (2x256KB) and LDS but the TMUs and the L0$ is still per CU (2xSIMD32).
That's not the case with an Nvidia SM or GCN Compute Unit, where everything is shared.

In CU mode a Workgroup is executed by one Compute Unit (2xSIMD32) using 256KB of registers, half of the LDS and its own TMUs + L0$.

You could of course take the freedom and call 4xSIMD32 (WGP mode) a Compute Unit but then you would need a name for CU mode and what half of the WGP (or half of the Compute Unit then) should be called.

AegonSnake · Jun 14, 2019

Xeontech said:
Hahaha this will be yours

lol haha.

i will wear the 12 tflops shame badge too. YOLO!

dgrdsv · Jun 14, 2019

Locuza said:
but the TMUs and the L0$ is still per CU

Are they? This wasn't really clear in any of the docs released so far.

The bigger issue though is that they haven't really provided any reason for such design. Why would anyone want to run a workload on WGP instead of a CU or vice versa?

Pheonix · Jun 14, 2019

AegonSnake said:
lol haha.

i will wear the 12 tflops shame badge too. YOLO!

Technically should we? In all fairness, most of us made our 12TF prediction at a time when we were calculating based on GCN flops. Now It's clear that 9.7TF'ish Navi flops is about equivalent to a 12TF GCN card. And if those consoles comen at anything over 10TF then they are about 12TF+ compared to a GCN card.

Basically, we have an out.....

Fastidioso · Jun 14, 2019

Doesn't Sony and/or MS said to want to overtake the Stadia TF?

Imitation Of Life · Jun 14, 2019

formasymphonic said:
you missed the superior "4K reconstructed 60fps with enhanced assets" option

This is what I want. I'm not hung on game being native 4K. Give me 60fps and betters assets. Native 4K should be an option, but I don't make it the only option.

Pheonix · Jun 14, 2019

Fastidioso said:
I firmly believe the same TF of Stadia at the worst. Though if I'm not wrong doesn't Sony and MS said to want to overtake the Stadia TF?

Question is are you talking about RDNA 10.7TF? Or 8.56TF?

One matches the stadia numerically, but the Stadia would need around 13.3TF to match it, the other matches the Stadia in performance but will have a smaller TF number.

Next-gen PS5 and next Xbox speculation launch thread |OT5| - It's in RDNA

What do you think could be the memory setup of your preferred console, or one of the new consoles?

GDDR6

GDDR6 + DDR4

HBM2

HBM2 + DDR4

user requested account closure

Double Eleven

Prophet of Regret

User requested account closure

User requested account closure

User requested account closure

Attempted to circumvent ban with alt account