Around a few weeks before the launch of the Xiaomi Mi Note Pro, I was digging through kernel source to understand Snapdragon 810 and some other Qualcomm SoCs. We had kept track of MSM8994v1 and MSM8994v2, which were relatively well understood in terms of what they represented. V1 was an engineering sample variant, which was discussed as early as September in 2014. This variant set both the A57 and A53 clusters to a ~1 GHz maximum. V2 was the first mass production revision, which was present in both the LG G Flex 2, and HTC One M9. However, shortly before the launch of the Mi Note Pro, we noted references to v2.1 in various kernel source trees.

Immediately diving into the issue, what I found was that there wasn’t much of note about v2.1 other than a change to the chip ID and other identifying information, so any formal information on the changes between these revisions is scarce. Meanwhile phones that definitely didn’t have this variant of the SoC still had references to it in kernel source, which only made it all the harder to even identify v2.1 devices in the first place. Ultimately the only way to truly verify whether a device has a certain revision of an SoC is to either see whether the file is included in the compilation process or to verify the revision by checking the device itself.

Our big break came with the release of the Xiaomi Mi Note Pro, which was among the first (if not the first) devices to incorporate a v2.1 SoC. After getting our hands on a sample of the Mi Note Pro, we were finally able to start piecing together everything in order to really understand what was going on with this new variant.

The first and most obvious change in v2.1 is a mild increase in GPU clocks from 600 to 630 MHz. This represents around a 5% increase in performance, which helps to close the gap with the 772-700 MHz Mali T760MP8 graphics processor used in the Exynos 7420 of the Samsung Galaxy S6. This is fairly obvious to anyone that runs a graphics benchmark on this new variant, although this is a best-case improvement as inefficiencies elsewhere can erode the difference.

The other differences are far more subtle. The first is that there’s noticeably less throttling on the A57 cluster compared to Snapdragon 810 v2. However even with that change - and unlike the Snapdragon 808 and competing SoCs - both variants of the Snapdragon 810 still see the unfortunate characteristic of ultimately forcing all threads off of the A57 cluster to stay within TDP limits in high load conditions, such as when running Basemark OS II’s battery test.

The second difference requires testing peak CPU bandwidth and latency between L1, L2, and main memory, which required digging into our in-house benchmark tools. In the interest of seeing 810's maximum memory bandwidth, we put together a memory multithreaded memory benchmark that runs on multiple CPU cores, and combines read and write bandwidth to try and get as close as possible to maximum bandwidth available. The figures that we present here will represent a scenario where two threads, one reading and one writing will load the memory system simultaneously. This is done to try to saturate the main memory controller and to circumvent some of the quirks of ARM's bus architecture, which has dedicated read and write ports, leading to only half the total possible bandwidth in each direction.

We’ve also tested against multiple other applications such as Rahul Garg’s RgBandwidth and RamBench to verify these observations. We also checked with instrumented runs that the CPU, memory controller, and CCI interconnect were all running at appropriate frequencies so these results should be largely unaffected by the governor due to the 100% duty cycle of the load during the test.

The graphs above are interesting, but they don’t really illustrate the relative difference between v2 and v2.1. To better show this, the graphs below represent the percent difference between the HTC One M9 and Xiaomi Mi Note Pro in this test, where positive values are an advantage in favor of Mi Note Pro, while negative values represent an advantage in favor of the One M9.

From the calculations, memory latency improves on an average of 15%, but this is mostly concentrated in the L1/L2 cache portion of the test, while the differences in DRAM are relatively small. If we change the scaling to show less of main memory, we can see that most of the variance is present in that portion of the test. Given that we’re looking at time on the order of nanoseconds, I wouldn’t think too hard about these differences either. The only interesting thing to note in the latency test is that the “memory mountain” model is still valid, where we can expect about an order of magnitude increase in latency for each level of cache that we miss, where L1 appears to take about 5 cycles at 2 GHz, then around 15 cycles for L2, then around 600-700 cycles for main memory. (To put that into comparison for PC CPUs, L3 cache usually hits up to 70 cycles and up to 150 cycles for Intel's CrystalWell eDRAM before the jump out to main memory.)

The memory bandwidth values are really much more interesting though, as there’s a consistent improvement in bandwidth across the board. There is some interesting behavior around L1 cache, but this is likely due to the previously mentioned ramp time for frequency scaling. If we just look beyond L2, into the main memory, the result is that there’s around a 33% increase in bandwidth, and a 38% average increase if we look at the entire sweep through memory.

Overall, for a 0.1 revision, Snapdragon 810 v2.1 is a pretty significant change where memory is concerned. We also see some improvements in terms of thermal management, but we yet have to fully characterize just how much this improvement is and if it's enough to be able to compete with Samsung's 14nm silicon in the Galaxy S6.

Meanwhile there were rumors floating around of a new revision due to show up in the summer, and it appears that this is the rumored revision. This leads me to question whether it made sense to release devices with the v2 revision, especially if future revisions would have resolved some notable issues and dramatically improved SoC performance. I also question whether OEMs would silently introduce these SoCs into later production runs of products that first shipped with v2, which is yet another can of worms.

Putting these questions aside, this new revision brings much-needed improvements to the Snapdragon 810, and should go a long way as a stop-gap until Snapdragon 820 begins shipping. We'll see how the change from v2 to v2.1 affects performance on a higher level in our Xiaomi Mi Note Pro review coming shortly.

Comments Locked

84 Comments

View All Comments

  • Freaky_Angelus - Friday, June 19, 2015 - link

    Sadly no, you wake the entire cluster and you can play with frequency and activity. So the entire cluster is awake and you use 1-4 of their cores.

    This is were the 808 is easier on thermal load. 4x A53 and 2x A57, so reduced thermal load by just saying "we don't need 4x A57". Not a bad decision at all, unless you're buying a phone for gaming, you wouldn't need so much power that often.
  • grayson_carr - Wednesday, June 24, 2015 - link

    I love my G4, and I'm glad it only has 2x A57 cores, but damn do I wish it had the same Adreno 430 as the 810. That would be the perfect combo IMO. Why in hell did they go with the 418?
  • testbug00 - Friday, June 19, 2015 - link

    or, maybe a phone only needs one fast core if you're going to do big.LITTLE. Better yet, having cores that are low power and high performance or low performance with high scalability are the best designs.

    Apple's custom cores and Krait are examples of both.
  • mortimerr - Friday, June 19, 2015 - link

    Anyone know when the 820 is supposed to be making its way into new devices?
    Because if the 810 is going to go through constant revisions to make it any good then I'd rather just hold out for the 820. On top of that, awhile down the line could see Qualcomm finally lose their hold over SoCs in Android devices after what we've seen from Asus.
  • Taristin - Friday, June 19, 2015 - link

    So where can someone in the US even get a Mi Note Pro, assuming we were masochistic enough to want one? (Ive read that the phone could only use 3G/HSPA networks on T-Mobile in the US, but that's fast enough for most tasks.)
  • tygrus - Saturday, June 20, 2015 - link

    The 810 was 20nm so a shrink to 14nm should reduce the power load at the same clockspeed. It would be nice to be able to profile the apps and phone mode to set performance limits. Some games use high CPU & GPU even while doing very liitle so I would love to tell it 20fps (75% MHz of A53 and 50% GPU) instead of 60+ fps when I don't need to aim and shoot. Less CPU for background apps.
  • serendip - Saturday, June 20, 2015 - link

    Seriously? I was going to get a Mi Note Pro but I held back because of the battery life and overheating issues.

    How long does your battery last? I'm a typical business user, mainly emails and calls, with Web browsing and some music listening after work. I'm interested in the Note mainly for the screen, I hope it can be easily read outdoors.
  • Meshh - Monday, June 22, 2015 - link

    For my needs, it's quite good : i'm usually around 30/40% near midnight, with 2H+ watching videos, light web browsing, some texting + mails, and (not so much) music listening.

    I didnt compare with the classic Note (probably less draining) display, but the screen is gorgeous. The sunlight display features (https://www.youtube.com/watch?v=jZkIZWo9yHc) sure helps but since the last days were cloudy here, i couldnt really test the feature.
  • Martin84a - Saturday, June 20, 2015 - link

    http://www.afterdawn.com/news/article.cfm/2015/03/...

    Enough said.
  • Freaky_Angelus - Tuesday, June 23, 2015 - link

    LOL, you're referencing a site that uses a Dutch site (tweakers.net) who found that in a single benchmark the HTC throttling was uhm... let's call it absent. That's not an heat problem, that's an heat management problem. The irony of Tweakers being referenced here when I reference Anandtech there, that made my morning!

    Now I won't blame you for not reading Dutch, but as I can... That said has revised it's results due to the update. Which was why Anandtech held off with Part 2 of the review for about a week.

Log in

Don't have an account? Sign up now