Skip to content

Commit 5254568

Browse files
committed
fix(linux): Fix headers and whitespace in AM62X performance rst
The performance guides use incorrect header hierarchy, fix header hierarchy and whitespace to establish an easier to read/parse html. Signed-off-by: Judith Mendez <jm@ti.com>
1 parent 6c3f7fc commit 5254568

2 files changed

Lines changed: 93 additions & 75 deletions

File tree

source/devices/AM62X/linux/Linux_Performance_Guide.rst

Lines changed: 78 additions & 65 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,11 @@
11

2-
====================================
2+
#################################
33
Linux 12.00.00 Performance Guide
4-
====================================
4+
#################################
55

6-
.. rubric:: **Read This First**
7-
:name: read-this-first-kernel-perf-guide
6+
***************
7+
Read This First
8+
***************
89

910
**All performance numbers provided in this document are gathered using
1011
following Evaluation Modules unless otherwise specified.**
@@ -21,25 +22,30 @@ following Evaluation Modules unless otherwise specified.**
2122

2223
Table: Evaluation Modules
2324

24-
.. rubric:: About This Manual
25-
:name: about-this-manual-kernel-perf-guide
25+
*****************
26+
About This Manual
27+
*****************
2628

2729
This document provides performance data for each of the device drivers
2830
which are part of the Processor SDK Linux package. This document should be
2931
used in conjunction with release notes and user guides provided with the
3032
Processor SDK Linux package for information on specific issues present
3133
with drivers included in a particular release.
3234

33-
.. rubric:: If You Need Assistance
34-
:name: if-you-need-assistance-kernel-perf-guide
35-
3635
For further information or to report any problems, contact
3736
https://e2e.ti.com/ or https://support.ti.com/
3837

38+
|
39+
40+
*****************
3941
System Benchmarks
40-
-----------------
42+
*****************
43+
44+
|
45+
4146
LMBench
42-
^^^^^^^
47+
=======
48+
4349
LMBench is a collection of microbenchmarks of which the memory bandwidth
4450
and latency related ones are typically used to estimate processor
4551
memory system performance. More information about lmbench at
@@ -183,7 +189,8 @@ Execute the LMBench with the following:
183189
"tcp_latency_using_localhost (microsec)","1.00 (min 0.85, max 1.14)","0.89 (min 0.76, max 1.02)","0.76"
184190

185191
Dhrystone
186-
^^^^^^^^^
192+
=========
193+
187194
Dhrystone is a core only benchmark that runs from warm L1 caches in all
188195
modern processors. It scales linearly with clock speed.
189196

@@ -205,7 +212,8 @@ Execute the benchmark with the following:
205212
"dhrystone_per_second (dhrystonep)","6027183.60 (min 5882353.00, max 6250000.00)","6789289.58 (min 6451613.00, max 7142857.00)","6819923.17 (min 6666666.50, max 6896551.50)"
206213

207214
Whetstone
208-
^^^^^^^^^
215+
=========
216+
209217
Whetstone is a benchmark primarily measuring floating-point arithmetic performance.
210218

211219
Execute the benchmark with the following:
@@ -220,7 +228,8 @@ Execute the benchmark with the following:
220228
"whetstone (mips)","4444.43 (min 3333.30, max 5000.00)","5000.00","5000.00"
221229

222230
Linpack
223-
^^^^^^^
231+
=======
232+
224233
Linpack measures peak double precision (64 bit) floating point performance in
225234
solving a dense linear system.
226235

@@ -230,7 +239,8 @@ solving a dense linear system.
230239
"linpack (kflops)","515140.50 (min 508416.00, max 518513.00)","581699.00 (min 581477.00, max 581921.00)","578855.50 (min 578148.00, max 579563.00)"
231240

232241
NBench
233-
^^^^^^
242+
======
243+
234244
NBench which stands for Native Benchmark is used to measure macro benchmarks
235245
for commonly used operations such as sorting and analysis algorithms.
236246
More information about NBench at
@@ -251,7 +261,8 @@ https://nbench.io/articles/index.html
251261
"string_sort (iterations)","150.20 (min 150.14, max 150.27)","168.20 (min 168.14, max 168.29)","168.19 (min 168.17, max 168.21)"
252262

253263
Stream
254-
^^^^^^
264+
======
265+
255266
STREAM is a microbenchmark for measuring data memory system performance without
256267
any data reuse. It is designed to miss on caches and exercise data prefetcher
257268
and speculative accesses.
@@ -277,7 +288,8 @@ Execute the benchmark with the following:
277288
"triad (mb/s)","1349.88 (min 1303.10, max 1385.40)","1615.13 (min 1494.30, max 1856.90)","1667.70"
278289

279290
CoreMarkPro
280-
^^^^^^^^^^^
291+
===========
292+
281293
CoreMark®-Pro is a comprehensive, advanced processor benchmark that works with
282294
and enhances the market-proven industry-standard EEMBC CoreMark® benchmark.
283295
While CoreMark stresses the CPU pipeline, CoreMark-Pro tests the entire processor,
@@ -313,7 +325,8 @@ and floating-point workloads, and data sets for utilizing larger memory subsyste
313325
"zip-test (workloads/)","35.29 (min 33.90, max 36.36)","40.31 (min 38.46, max 42.55)","40.60 (min 38.46, max 41.67)"
314326

315327
MultiBench
316-
^^^^^^^^^^
328+
==========
329+
317330
MultiBench™ is a suite of benchmarks that allows processor and system designers to
318331
analyze, test, and improve multicore processors. It uses three forms of concurrency:
319332
Data decomposition: multiple threads cooperating on achieving a unified goal and
@@ -361,11 +374,13 @@ thread-enabled workloads to be tested.
361374
"x264-4mq (workloads/)","0.49 (min 0.48, max 0.50)","0.56 (min 0.55, max 0.57)","0.56 (min 0.56, max 0.57)"
362375
"x264-4mqw1 (workloads/)","0.49 (min 0.49, max 0.50)","0.56 (min 0.54, max 0.57)","0.56"
363376

377+
|
378+
364379
Boot-time Measurement
365-
---------------------
380+
=====================
366381

367382
Boot media: MMCSD
368-
^^^^^^^^^^^^^^^^^
383+
-----------------
369384

370385
.. csv-table:: Linux boot time MMCSD
371386
:header: "Boot Configuration","am62xx_lp_sk-fs: Boot time in seconds: avg(min,max)","am62xx_sk-fs: Boot time in seconds: avg(min,max)","am62xxsip_sk-fs: Boot time in seconds: avg(min,max)"
@@ -376,8 +391,8 @@ Boot time numbers [avg, min, max] are measured from "Starting kernel" to Linux p
376391

377392
|
378393
379-
ALSA SoC Audio Driver
380-
---------------------
394+
ALSA SoC Audio
395+
==============
381396

382397
#. Access type - RW\_INTERLEAVED
383398
#. Channels - 2
@@ -412,11 +427,12 @@ ALSA SoC Audio Driver
412427

413428
|
414429
415-
Graphics SGX/RGX Driver
416-
-----------------------
430+
Graphics SGX/RGX
431+
================
417432

418433
GFXBench
419-
^^^^^^^^
434+
--------
435+
420436
Run GFXBench and capture performance reported (Score and Display rate in fps). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
421437

422438
.. csv-table:: GFXBench Performance
@@ -427,7 +443,7 @@ Run GFXBench and capture performance reported (Score and Display rate in fps). A
427443
" GFXBench 5.x gl_5_high_off","11.08 (min 10.87, max 11.19)","0.17","11.79 (min 11.60, max 11.89)","0.18"
428444

429445
Glmark2
430-
^^^^^^^
446+
-------
431447

432448
Run Glmark2 and capture performance reported (Score). All display outputs (HDMI, Displayport and/or LCD) are connected when running these tests
433449

@@ -441,7 +457,8 @@ Run Glmark2 and capture performance reported (Score). All display outputs (HDMI,
441457
|
442458
443459
Ethernet
444-
-----------------
460+
========
461+
445462
Ethernet performance benchmarks were measured using :command:`netperf` 2.7.1 https://hewlettpackard.github.io/netperf/doc/netperf.html
446463
Test procedures were modeled after those defined in RFC-2544:
447464
https://tools.ietf.org/html/rfc2544, where the DUT is the TI device
@@ -507,29 +524,29 @@ Running the following commands will trigger :command:`netperf` clients to measur
507524
netperf -H <DUT ip> -j -C -l 60 -t UDP_STREAM -b <burst_size> -w <wait_time> -- -m <UDP datagram size>
508525
-k DIRECTION,THROUGHPUT,MEAN_LATENCY,LOCAL_CPU_UTIL,REMOTE_CPU_UTIL,LOCAL_BYTES_SENT,REMOTE_BYTES_RECVD,LOCAL_SEND_SIZE
509526
510-
CPSW/CPSW2g/CPSW3g Ethernet Driver
511-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
527+
CPSW/CPSW2g/CPSW3g Ethernet
528+
---------------------------
512529

513530
- CPSW3g: AM62x
514531

515-
.. rubric:: TCP Bidirectional Throughput
516-
:name: CPSW2g-tcp-bidirectional-throughput
532+
TCP Bidirectional Throughput
533+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
517534

518535
.. csv-table:: CPSW2g TCP Bidirectional Throughput
519536
:header: "Command Used","am62xx_lp_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_lp_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xx_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xxsip_sk-fs: THROUGHPUT (Mbits/sec)","am62xxsip_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
520537

521538
"netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_STREAM; netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_MAERTS","1606.03 (min 1303.43, max 1816.77)","60.01 (min 39.33, max 71.84)","1540.48 (min 1283.30, max 1773.53)","52.49 (min 40.80, max 68.45)","1673.91 (min 1656.34, max 1691.48)","64.30 (min 62.64, max 65.95)"
522539

523-
.. rubric:: TCP Bidirectional Throughput Interrupt Pacing
524-
:name: CPSW2g-tcp-bidirectional-throughput-interrupt-pacing
540+
TCP Bidirectional Throughput Interrupt Pacing
541+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
525542

526543
.. csv-table:: CPSW2g TCP Bidirectional Throughput Interrupt Pacing
527544
:header: "Command Used","am62xx_lp_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_lp_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xx_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xxsip_sk-fs: THROUGHPUT (Mbits/sec)","am62xxsip_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
528545

529546
"netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_STREAM; netperf -H 192.168.0.1 -j -c -C -l 60 -t TCP_MAERTS","1641.49 (min 1549.80, max 1756.23)","51.62 (min 39.44, max 63.10)","1563.91 (min 1444.87, max 1707.72)","39.79 (min 33.68, max 47.59)","1731.44 (min 1694.58, max 1768.29)","50.23 (min 42.61, max 57.85)"
530547

531-
.. rubric:: UDP Throughput
532-
:name: CPSW2g-udp-throughput-0-loss
548+
UDP Throughput
549+
^^^^^^^^^^^^^^
533550

534551
.. csv-table:: CPSW2g UDP Egress Throughput 0 loss
535552
:header: "Frame Size(bytes)","am62xx_lp_sk-fs: UDP Datagram Size(bytes) (LOCAL_SEND_SIZE)","am62xx_lp_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_lp_sk-fs: Packets Per Second (kPPS)","am62xx_lp_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xx_sk-fs: UDP Datagram Size(bytes) (LOCAL_SEND_SIZE)","am62xx_sk-fs: THROUGHPUT (Mbits/sec)","am62xx_sk-fs: Packets Per Second (kPPS)","am62xx_sk-fs: CPU Load % (LOCAL_CPU_UTIL)","am62xxsip_sk-fs: UDP Datagram Size(bytes) (LOCAL_SEND_SIZE)","am62xxsip_sk-fs: THROUGHPUT (Mbits/sec)","am62xxsip_sk-fs: Packets Per Second (kPPS)","am62xxsip_sk-fs: CPU Load % (LOCAL_CPU_UTIL)"
@@ -560,38 +577,32 @@ CPSW/CPSW2g/CPSW3g Ethernet Driver
560577

561578
|
562579
563-
Linux OSPI Flash Driver
564-
-----------------------
565-
566-
.. rubric:: AM62XX-SK
567-
:name: am62xx-sk-ospi-flash-driver
580+
OSPI
581+
====
568582

569-
.. rubric:: RAW
570-
:name: am62xx-sk-ospi-raw
583+
OSPI RAW
584+
--------
571585

572586
.. csv-table:: OSPI Raw Flash Driver
573587
:header: "File size (Mbytes)","am62xx_sk-fs: Raw Read Throughput (Mbytes/sec)"
574588

575589
"50","151.67 (min 21.01, max 192.31)"
576590

577-
.. rubric:: RAW
578-
:name: am62xx-sk-lp-ospi-raw
579-
580591
.. csv-table:: OSPI Raw Flash Driver
581592
:header: "File size (Mbytes)","am62xx_lp_sk-fs: Raw Read Throughput (Mbytes/sec)"
582593

583594
"50","30.49 (min 29.94, max 31.05)"
584595

585-
.. rubric:: RAW
586-
:name: am62xxsip-sk-ospi-raw
587-
588596
.. csv-table:: OSPI Raw Flash Driver
589597
:header: "File size (Mbytes)","am62xxsip_sk-fs: Raw Read Throughput (Mbytes/sec)"
590598

591599
"50","108.69"
592600

593-
EMMC Driver
594-
-----------
601+
|
602+
603+
EMMC
604+
====
605+
595606
.. warning::
596607

597608
**IMPORTANT**: The performance numbers can be severely affected if the media is
@@ -601,7 +612,7 @@ EMMC Driver
601612
re-mount in async mode.
602613

603614
EMMC EXT4 FIO 1G
604-
^^^^^^^^^^^^^^^^
615+
----------------
605616

606617
.. csv-table:: EMMC EXT4 FIO 1G
607618
:header: "Buffer size (bytes)","am62xx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Write EXT4 CPU Load (%)","am62xx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Read EXT4 CPU Load (%)"
@@ -628,7 +639,7 @@ EMMC EXT4 FIO 1G
628639
"256k","91.67 (min 91.00, max 92.50)","2.43 (min 2.08, max 2.63)","170.33 (min 170.00, max 171.00)","2.63 (min 2.35, max 2.79)"
629640

630641
EMMC EXT4
631-
^^^^^^^^^
642+
---------
632643

633644
.. csv-table:: EMMC EXT4
634645
:header: "Buffer size (bytes)","am62xx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Write EXT4 CPU Load (%)","am62xx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Read EXT4 CPU Load (%)"
@@ -658,7 +669,7 @@ EMMC EXT4
658669
"5242880","73.34 (min 11.84, max 97.06)","13.62 (min 3.25, max 24.94)","179.63 (min 178.91, max 180.27)","22.32 (min 19.21, max 26.81)"
659670

660671
EMMC VFAT
661-
^^^^^^^^^
672+
---------
662673

663674
.. csv-table:: EMMC VFAT
664675
:header: "Buffer size (bytes)","am62xx_sk-fs: Write VFAT Throughput (Mbytes/sec)","am62xx_sk-fs: Write VFAT CPU Load (%)","am62xx_sk-fs: Read VFAT Throughput (Mbytes/sec)","am62xx_sk-fs: Read VFAT CPU Load (%)"
@@ -687,8 +698,8 @@ EMMC VFAT
687698
"1048576","79.09 (min 59.19, max 96.71)","7.86 (min 3.77, max 12.68)","174.88 (min 174.20, max 175.33)","13.16 (min 11.76, max 15.48)"
688699
"5242880","78.00 (min 58.68, max 98.05)","8.31 (min 4.45, max 13.28)","174.56 (min 174.33, max 174.70)","13.21 (min 12.97, max 13.39)"
689700

690-
UBoot EMMC Driver
691-
-----------------
701+
UBoot EMMC
702+
----------
692703

693704

694705
.. csv-table:: UBOOT EMMC RAW
@@ -710,7 +721,7 @@ UBoot EMMC Driver
710721
"4000000","98611.89 (min 97234.42, max 99902.44)","175229.95"
711722

712723
MMCSD
713-
-----
724+
=====
714725

715726
.. warning::
716727

@@ -721,7 +732,7 @@ MMCSD
721732
re-mount in async mode.
722733

723734
MMC EXT4 FIO 1G
724-
^^^^^^^^^^^^^^^
735+
---------------
725736

726737
.. csv-table:: MMC EXT4 FIO 1G
727738
:header: "Buffer size (bytes)","am62xx_sk-fs: Write EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Write EXT4 CPU Load (%)","am62xx_sk-fs: Read EXT4 Throughput (Mbytes/sec)","am62xx_sk-fs: Read EXT4 CPU Load (%)"
@@ -748,7 +759,7 @@ MMC EXT4 FIO 1G
748759
"256k","32.00 (min 18.90, max 39.70)","1.36 (min 1.21, max 1.47)","83.60 (min 83.20, max 83.90)","1.58 (min 1.40, max 1.80)"
749760

750761
MMC EXT4
751-
^^^^^^^^
762+
--------
752763

753764
.. csv-table:: MMC EXT4
754765
:header: "Buffer size (bytes)","am62xx_sk-fs: Write Raw Throughput (Mbytes/sec)","am62xx_sk-fs: Write Raw CPU Load (%)","am62xx_sk-fs: Read Raw Throughput (Mbytes/sec)","am62xx_sk-fs: Read Raw CPU Load (%)"
@@ -816,11 +827,11 @@ The performance numbers were captured using the following:
816827

817828
|
818829
819-
USB Driver
820-
----------
830+
USB
831+
===
821832

822833
USB Device Controller
823-
^^^^^^^^^^^^^^^^^^^^^
834+
---------------------
824835

825836
.. csv-table:: USBDEVICE HIGHSPEED SLAVE_READ_THROUGHPUT
826837
:header: "Number of Blocks","am62xx_lp_sk-fs: Throughput (MB/sec)","am62xx_sk-fs: Throughput (MB/sec)","am62xxsip_sk-fs: Throughput (MB/sec)"
@@ -834,11 +845,11 @@ USB Device Controller
834845

835846
|
836847
837-
CRYPTO Driver
838-
-------------
848+
CRYPTO
849+
======
839850

840851
OpenSSL Performance
841-
^^^^^^^^^^^^^^^^^^^
852+
-------------------
842853

843854
.. csv-table:: OpenSSL Performance
844855
:header: "Algorithm","Buffer Size (in bytes)","am62xx_lp_sk-fs: throughput (KBytes/Sec)","am62xx_sk-fs: throughput (KBytes/Sec)","am62xxsip_sk-fs: throughput (KBytes/Sec)"
@@ -913,11 +924,13 @@ Listed for each algorithm are the code snippets used to run each
913924
time -v openssl speed -elapsed -evp aes-128-cbc
914925

915926
IPSec Software Performance
916-
^^^^^^^^^^^^^^^^^^^^^^^^^^
927+
--------------------------
917928

918929
.. csv-table:: IPSec Software Performance
919930
:header: "Algorithm","am62xx_sk-fs: Throughput (Mbps)","am62xx_sk-fs: Packets/Sec","am62xx_sk-fs: CPU Load","am62xxsip_sk-fs: Throughput (Mbps)","am62xxsip_sk-fs: Packets/Sec","am62xxsip_sk-fs: CPU Load"
920931

921932
"aes128","77.65 (min 73.60, max 81.70)","6.50 (min 6.00, max 7.00)","51.08 (min 50.78, max 51.38)","80.50","7.00","51.43"
922933
"aes192","0.60","0.00","56.22"
923934
"aes256","130.00 (min 53.40, max 206.60)","11.00 (min 4.00, max 18.00)","39.11 (min 27.23, max 50.98)","88.60 (min 0.40, max 205.40)","7.67 (min 0.00, max 18.00)","43.07 (min 27.21, max 51.38)"
935+
936+
|

0 commit comments

Comments
 (0)