Jetson TX2

SoCXavier
CPUCarmel 8 core
GPUVolta 512 Core
Memory16GB 256Bit LPDDR4
L1 d-cache64K
L1 i-cache128K
L2 cache2048K
L3 cache4096K
Storage32GB eMMC

スペック

  • CPU:Carmel 8 cores
  • cpuinfo
    $ cat /proc/cpuinfo
    processor       : 0
    model name      : ARMv8 Processor rev 0 (v8l)
    BogoMIPS        : 62.50
    Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer : 0x4e
    CPU architecture: 8
    CPU variant     : 0x0
    CPU part        : 0x004
    CPU revision    : 0
    MTS version     : 43306594
    
    processor       : 1
    model name      : ARMv8 Processor rev 0 (v8l)
    BogoMIPS        : 62.50
    Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer : 0x4e
    CPU architecture: 8
    CPU variant     : 0x0
    CPU part        : 0x004
    CPU revision    : 0
    MTS version     : 43306594
    
    processor       : 2
    model name      : ARMv8 Processor rev 0 (v8l)
    BogoMIPS        : 62.50
    Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer : 0x4e
    CPU architecture: 8
    CPU variant     : 0x0
    CPU part        : 0x004
    CPU revision    : 0
    MTS version     : 43306594
    
    processor       : 3
    model name      : ARMv8 Processor rev 0 (v8l)
    BogoMIPS        : 62.50
    Features        : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer : 0x4e
    CPU architecture: 8
    CPU variant     : 0x0
    CPU part        : 0x004
    CPU revision    : 0
    MTS version     : 43306594
  • 出荷時の状態では4コアしか起動していない
  • nvpmodelを0にして全力で回せるようにする
    sudo nvpmodel -m 0
  • 8コア現れる
    $ cat /proc/cpuinfo 
    processor	: 0
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 1
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 2
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 3
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 4
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 5
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 6
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
    
    processor	: 7
    model name	: ARMv8 Processor rev 0 (v8l)
    BogoMIPS	: 62.50
    Features	: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
    CPU implementer	: 0x4e
    CPU architecture: 8
    CPU variant	: 0x0
    CPU part	: 0x004
    CPU revision	: 0
    MTS version	: 43306594
  • fphp と asimdhp がArmv8.2の証
  • lscpu
    $ lscpu
    Architecture:         aarch64
    Byte Order:           Little Endian
    CPU(s):               8
    On-line CPU(s) list:  0-3
    Off-line CPU(s) list: 4-7
    Thread(s) per core:   1
    Core(s) per socket:   2
    Socket(s):            2
    Vendor ID:            Nvidia
    Model:                0
    Model name:           ARMv8 Processor rev 0 (v8l)
    Stepping:             0x0
    CPU max MHz:          2265.6001
    CPU min MHz:          115.2000
    BogoMIPS:             62.50
    L1d cache:            64K
    L1i cache:            128K
    L2 cache:             2048K
    L3 cache:             4096K
    Flags:                fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp
  • cpufreq
    $ cat /sys/bus/cpu/devices/cpu?/cpufreq/cpuinfo_max_freq
    2265600
    2265600
    2265600
    2265600
    2265600
    2265600
    2265600
    2265600
  • kernel
    $ uname -a
    Linux jetson-0423618001106 4.9.108-tegra #1 SMP PREEMPT Wed Oct 31 15:17:21 PDT 2018 aarch64 aarch64 aarch64 GNU/Linux
  • OS
    $ lsb_release -a
    No LSB modules are available.
    Distributor ID:	Ubuntu
    Description:	Ubuntu 18.04.1 LTS
    Release:	18.04
    Codename:	bionic
  • gcc
    $ gcc --version
    gcc (Ubuntu/Linaro 7.3.0-27ubuntu1~18.04) 7.3.0
    Copyright (C) 2017 Free Software Foundation, Inc.
    This is free software; see the source for copying conditions.  There is NO
    warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
  • nvcc
    $ nvcc --version
    nvcc: NVIDIA (R) Cuda compiler driver
    Copyright (c) 2005-2018 NVIDIA Corporation
    Built on Sun_Aug_12_21:08:25_CDT_2018
    Cuda compilation tools, release 10.0, V10.0.117
  • deviceQuery
    $ /usr/local/cuda-10.0/samples/1_Utilities/deviceQuery/deviceQuery
    /usr/local/cuda-10.0/samples/1_Utilities/deviceQuery/deviceQuery Starting...
    
     CUDA Device Query (Runtime API) version (CUDART static linking)
    
    Detected 1 CUDA Capable device(s)
    
    Device 0: "Xavier"
      CUDA Driver Version / Runtime Version          10.0 / 10.0
      CUDA Capability Major/Minor version number:    7.2
      Total amount of global memory:                 15819 MBytes (16587653120 bytes)
      ( 8) Multiprocessors, ( 64) CUDA Cores/MP:     512 CUDA Cores
      GPU Max Clock rate:                            1500 MHz (1.50 GHz)
      Memory Clock rate:                             1500 Mhz
      Memory Bus Width:                              256-bit
      L2 Cache Size:                                 524288 bytes
      Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
      Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers
      Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers
      Total amount of constant memory:               65536 bytes
      Total amount of shared memory per block:       49152 bytes
      Total number of registers available per block: 65536
      Warp size:                                     32
      Maximum number of threads per multiprocessor:  2048
      Maximum number of threads per block:           1024
      Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
      Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
      Maximum memory pitch:                          2147483647 bytes
      Texture alignment:                             512 bytes
      Concurrent copy and kernel execution:          Yes with 1 copy engine(s)
      Run time limit on kernels:                     No
      Integrated GPU sharing Host Memory:            Yes
      Support host page-locked memory mapping:       Yes
      Alignment requirement for Surfaces:            Yes
      Device has ECC support:                        Disabled
      Device supports Unified Addressing (UVA):      Yes
      Device supports Compute Preemption:            Yes
      Supports Cooperative Kernel Launch:            Yes
      Supports MultiDevice Co-op Kernel Launch:      Yes
      Device PCI Domain ID / Bus ID / location ID:   0 / 0 / 0
      Compute Mode:
         < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
    
    deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.0, CUDA Runtime Version = 10.0, NumDevs = 1
    Result = PASS

Jetpack

All ON

  • nvpmodel 0 is maxN
    !$ sudo crontab -l | tail -n 2
    @reboot nvpmodel -m 0
    @reboot echo 255 > /sys/kernel/debug/tegra_fan/target_pwm

インストール

  • cmake以外はだいたい入ってた
    sudo apt-get -y install cmake ccache

perf

  • Installing Linux performance monitoring tools on Jetson TX2 - NVIDIA Developer Forums
  • perf はカーネルとぴったり合ったバージョンのlinux-toolsを入れる必要がある
  • が、Jetsonのkernelはカスタマイズ版だしなぁ、と思っていたら、ピッタリの項目があった。
  • /usr/src/linux-headers-4.9.108-tegra/kernel-4.9/tools/perf 以下にソースコードがあるから、そこでsudo make すれば良いという豪快な答え。すげぇ。

トップ   編集 凍結 差分 バックアップ 添付 複製 名前変更 リロード   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS
Last-modified: 2018-12-27 (木) 16:56:47 (145d)