Jetson Xavier の変更点

追加された行はこの色です。
削除された行はこの色です。
Jetson Xavier へ行く。
[[Jetson TX2]]

#contents


|SoC|Xavier|
|CPU|Carmel 8 core|
|GPU|Volta 512 Core|
|Memory|16GB 256Bit LPDDR4|
|L1 d-cache|64K|
|L1 i-cache|128K|
|L2 cache|2048K|
|L3 cache|4096K|
|Storage|32GB eMMC|

-https://www.jetsonhacks.com/2018/10/07/nvpmodel-nvidia-jetson-agx-xavier-developer-kit/
* スペック [#k8abbc96]
-CPU:Carmel 8 cores
-cpuinfo
 $ cat /proc/cpuinfo
 processor       : 0
 model name      : ARMv8 Processor rev 0 (v8l)
 BogoMIPS        : 62.50
 Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer : 0x4e
 CPU architecture: 8
 CPU variant     : 0x0
 CPU part        : 0x004
 CPU revision    : 0
 MTS version     : 43306594
 
 processor       : 1
 model name      : ARMv8 Processor rev 0 (v8l)
 BogoMIPS        : 62.50
 Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer : 0x4e
 CPU architecture: 8
 CPU variant     : 0x0
 CPU part        : 0x004
 CPU revision    : 0
 MTS version     : 43306594
 
 processor       : 2
 model name      : ARMv8 Processor rev 0 (v8l)
 BogoMIPS        : 62.50
 Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer : 0x4e
 CPU architecture: 8
 CPU variant     : 0x0
 CPU part        : 0x004
 CPU revision    : 0
 MTS version     : 43306594
 
 processor       : 3
 model name      : ARMv8 Processor rev 0 (v8l)
 BogoMIPS        : 62.50
 Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer : 0x4e
 CPU architecture: 8
 CPU variant     : 0x0
 CPU part        : 0x004
 CPU revision    : 0
 MTS version     : 43306594
-出荷時の状態では4コアしか起動していない
-nvpmodelを0にして全力で回せるようにする
 sudo nvpmodel -m 0
-8コア現れる
 $ cat /proc/cpuinfo 
 processor	: 0
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 1
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 2
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 3
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 4
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 5
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 6
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
 
 processor	: 7
 model name	: ARMv8 Processor rev 0 (v8l)
 BogoMIPS	: 62.50
 Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
 CPU implementer	: 0x4e
 CPU architecture: 8
 CPU variant	: 0x0
 CPU part	: 0x004
 CPU revision	: 0
 MTS version	: 43306594
-fphp と asimdhp がArmv8.2の証
-lscpu
 $ lscpu
 Architecture:         aarch64
 Byte Order:           Little Endian
 CPU(s):               8
 On-line CPU(s) list:  0-3
 Off-line CPU(s) list: 4-7
 Thread(s) per core:   1
 Core(s) per socket:   2
 Socket(s):            2
 Vendor ID:            Nvidia
 Model:                0
 Model name:           ARMv8 Processor rev 0 (v8l)
 Stepping:             0x0
 CPU max MHz:          2265.6001
 CPU min MHz:          115.2000
 BogoMIPS:             62.50
 L1d cache:            64K
 L1i cache:            128K
 L2 cache:             2048K
 L3 cache:             4096K
 Flags:                fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
-cpufreq
 $ cat /sys/bus/cpu/devices/cpu?/cpufreq/cpuinfo_max_freq
 2265600
 2265600
 2265600
 2265600
 2265600
 2265600
 2265600
 2265600
-kernel
 $ uname -a
 Linux jetson-0423618001106 4.9.108-tegra #1 SMP PREEMPT Wed Oct 31 15:17:21 PDT 2018 aarch64 aarch64 aarch64 GNU/Linux
-OS
 $ lsb_release -a
 No LSB modules are available.
 Distributor ID:	Ubuntu
 Description:	Ubuntu 18.04.1 LTS
 Release:	18.04
 Codename:	bionic
-gcc
 $ gcc --version
 gcc (Ubuntu/Linaro 7.3.0-27ubuntu1~18.04) 7.3.0
 Copyright (C) 2017 Free Software Foundation, Inc.
 This is free software; see the source for copying conditions.  There is NO
 warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
-nvcc
 $ nvcc --version
 nvcc: NVIDIA (R) Cuda compiler driver
 Copyright (c) 2005-2018 NVIDIA Corporation
 Built on Sun_Aug_12_21:08:25_CDT_2018
 Cuda compilation tools, release 10.0, V10.0.117
-deviceQuery
 $ /usr/local/cuda-10.0/samples/1_Utilities/deviceQuery/deviceQuery
 /usr/local/cuda-10.0/samples/1_Utilities/deviceQuery/deviceQuery Starting...
 
  CUDA Device Query (Runtime API) version (CUDART static linking)
 
 Detected 1 CUDA Capable device(s)
 
 Device 0: "Xavier"
   CUDA Driver Version / Runtime Version          10.0 / 10.0
   CUDA Capability Major/Minor version number:    7.2
   Total amount of global memory:                 15819 MBytes (16587653120 bytes)
   ( 8) Multiprocessors, ( 64) CUDA Cores/MP:     512 CUDA Cores
   GPU Max Clock rate:                            1500 MHz (1.50 GHz)
   Memory Clock rate:                             1500 Mhz
   Memory Bus Width:                              256-bit
   L2 Cache Size:                                 524288 bytes
   Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
   Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers
   Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers
   Total amount of constant memory:               65536 bytes
   Total amount of shared memory per block:       49152 bytes
   Total number of registers available per block: 65536
   Warp size:                                     32
   Maximum number of threads per multiprocessor:  2048
   Maximum number of threads per block:           1024
   Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
   Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
   Maximum memory pitch:                          2147483647 bytes
   Texture alignment:                             512 bytes
   Concurrent copy and kernel execution:          Yes with 1 copy engine(s)
   Run time limit on kernels:                     No
   Integrated GPU sharing Host Memory:            Yes
   Support host page-locked memory mapping:       Yes
   Alignment requirement for Surfaces:            Yes
   Device has ECC support:                        Disabled
   Device supports Unified Addressing (UVA):      Yes
   Device supports Compute Preemption:            Yes
   Supports Cooperative Kernel Launch:            Yes
   Supports MultiDevice Co-op Kernel Launch:      Yes
   Device PCI Domain ID / Bus ID / location ID:   0 / 0 / 0
   Compute Mode:
      < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
 
 deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.0, CUDA Runtime Version = 10.0, NumDevs = 1
 Result = PASS


* Jetpack [#q62a11a3]
-[[Download Jetpack 4.1>https://developer.nvidia.com/embedded-computing]]
* All ON [#jde078b2]
-nvpmodel 0 is maxN
 !$ sudo crontab -l | tail -n 2
 @reboot nvpmodel -m 0
 @reboot echo 255 > /sys/kernel/debug/tegra_fan/target_pwm
*インストール [#zefec371]
-cmake以外はだいたい入ってた
 sudo apt-get -y install cmake ccache

*perf [#u76c98fe]
-[[Installing Linux performance monitoring tools on Jetson TX2 - NVIDIA Developer Forums>https://devtalk.nvidia.com/default/topic/1035531/jetson-tx2/installing-linux-performance-monitoring-tools-on-jetson-tx2/]]
-perf はカーネルとぴったり合ったバージョンのlinux-toolsを入れる必要がある
-が、Jetsonのkernelはカスタマイズ版だしなぁ、と思っていたら、ピッタリの項目があった。
-/usr/src/linux-headers-4.9.108-tegra/kernel-4.9/tools/perf 以下にソースコードがあるから、そこでsudo make すれば良いという豪快な答え。すげぇ。