Commit Graph

  • 09bc1df7ae Pass vecadd. main Colin 2024-01-31 23:54:44 +0800
  • fd56811650 Refactoring the codebase. Remove useless variables; Add comments; Remove useless header files; Remove hard code and support both x86 and ARM CPU Ruobing Han 2023-12-13 14:29:17 -0500
  • 50d615da64
    Merge pull request #1 from drcut/MICRO_dev Ruobing Han 2023-04-01 17:08:50 -0400
  • 7e029ba1d5 fix bug for AllocaInst that are wrapped in a PR Ruobing Han 2023-04-01 16:59:26 -0400
  • eb140b3fc4
    Update README.md Ruobing Han 2023-03-14 20:39:51 -0400
  • 622835aa5e update CMakeLists Ruobing Han 2022-09-30 10:44:31 -0400
  • 7755b5d61d trigger CI for verification Ruobing Han 2022-09-30 10:41:23 -0400
  • 3bc3fda11b trigger CI for verification Ruobing Han 2022-09-30 10:40:59 -0400
  • 66716b16ec trigger CI for verification Ruobing Han 2022-09-30 10:40:25 -0400
  • eadc543826 add acknowledge in README Ruobing Han 2022-09-28 09:54:30 -0400
  • 5f5a0e78d2 add acknowledge in README Ruobing Han 2022-09-28 09:54:12 -0400
  • 940420d169 update README with make test Ruobing Han 2022-09-25 21:34:01 -0400
  • 4966cb80fc update CMakelists Ruobing Han 2022-09-25 21:22:05 -0400
  • 921509d79c update CMakelists Ruobing Han 2022-09-25 21:19:33 -0400
  • 9e8b133fd6 change tests to test Ruobing Han 2022-09-25 13:37:51 -0400
  • ff74bc9695 add verbose for CTest Ruobing Han 2022-09-25 13:18:47 -0400
  • a5235206e7 update CTest Ruobing Han 2022-09-25 12:58:58 -0400
  • 118af6c074 add Hetero-mark into CTest Ruobing Han 2022-09-25 12:35:49 -0400
  • 9e40c806fb Merge branch 'PPoPP' of github.com:drcut/CuPBoP into PPoPP Ruobing Han 2022-09-23 13:08:40 -0400
  • 18b577b008 add CMake test Ruobing Han 2022-09-23 13:08:28 -0400
  • 397cb9443f
    Create LICENSE Ruobing Han 2022-09-23 09:28:00 -0400
  • 9093f802b0
    Delete LICENSE Ruobing Han 2022-09-23 09:26:32 -0400
  • 63c9cc566c avoid unnecessary extend arrays Ruobing Han 2022-09-23 09:15:10 -0400
  • c6442c8b23 modify name of extend arrays Ruobing Han 2022-09-23 09:11:45 -0400
  • ed51e5af91 apply divergence analysis for replicating local variables Ruobing Han 2022-09-22 16:24:16 -0400
  • e99205aa8b apply divergence analysis for replicating local variables Ruobing Han 2022-09-22 16:15:38 -0400
  • 8da1ecc5fd update README Han Ruobing 2022-09-22 14:53:32 -0400
  • 124e7fa0ae remove unless initilization Han Ruobing 2022-09-22 14:44:42 -0400
  • dabe03409e update CI Ruobing Han 2022-09-22 11:43:03 -0400
  • c0c3490e23 update CI Ruobing Han 2022-09-22 11:32:03 -0400
  • a8643e6981 update CI Ruobing Han 2022-09-22 11:25:33 -0400
  • c2222f2e39 update CMake to use official CUDA toolkit Ruobing Han 2022-09-22 11:20:50 -0400
  • f712c30b09 implement multistream APIs for CPU backend Ruobing Han 2022-09-19 10:41:40 -0400
  • ca089c4274 fix CI Ruobing Han 2022-09-16 09:09:57 -0400
  • 5b40786ae3 fix CI Ruobing Han 2022-09-15 21:14:29 -0400
  • 961a931f10 fix CI Ruobing Han 2022-09-15 21:04:51 -0400
  • e5f020d997 add static/dynamic shared memory example Ruobing Han 2022-09-15 20:51:53 -0400
  • 3d22cc1f36 fix bug for dynamic shared memory Ruobing Han 2022-09-15 20:38:48 -0400
  • ba2c49abdd add static shared memory example Ruobing Han 2022-09-15 18:53:13 -0400
  • 3875e179b4 update runtime and threadPool with debug tools Ruobing Han 2022-09-15 18:43:14 -0400
  • f2a4f7fe64 update HostTranslator with debug tools Ruobing Han 2022-09-15 18:19:13 -0400
  • bb3724c486 update compilation with DEBUG mode Ruobing Han 2022-09-15 12:33:28 -0400
  • 9152feb24f remove useless examples Ruobing Han 2022-09-15 11:31:58 -0400
  • 49adfd026c add vecadd example and update README.md Ruobing Han 2022-09-15 11:15:21 -0400
  • 91e94ad3a6 fix bug for segfault if without cudaSetDevice Ruobing Han 2022-09-15 11:10:44 -0400
  • ef77421142 add back O3 optimization in kernelTranslator Ruobing Han 2022-09-07 20:17:34 -0400
  • 9cbbad3c4b update CI/CD Ruobing Han 2022-09-07 19:50:49 -0400
  • 8df75daf25 update commands in CI/CD Ruobing Han 2022-09-07 19:42:59 -0400
  • e0a361f47a fix coding style issue Ruobing Han 2022-09-07 19:38:21 -0400
  • 7572e0df27 change CMakeLists to include lock-free queue Ruobing Han 2022-09-07 19:32:22 -0400
  • f67d2849a4 add external party for lock-free queue Ruobing Han 2022-09-07 19:23:51 -0400
  • e0db88fb49 edit README.md Ruobing Han 2022-09-07 19:21:14 -0400
  • cf12d604eb support CloverLeaf on LLVM14 Ruobing Han 2022-07-13 18:39:59 -0400
  • 8fddb647bd remove performance optimization in kernelTranslator Ruobing Han 2022-06-25 15:22:50 -0400
  • fc1ed8d224 remove optnone metadata Ruobing Han 2022-06-25 14:44:50 -0400
  • 57367c8348 fix bug in sync Ruobing Han 2022-06-20 23:57:51 -0400
  • c1045d8140 fix bug in CI Ruobing Han 2022-06-20 23:03:01 -0400
  • 2618bd21a7 integrate lock-free queue into CI Ruobing Han 2022-06-20 22:53:19 -0400
  • db585083bb use lock-free queue Ruobing Han 2022-06-20 22:51:12 -0400
  • cbf4cd90d8 [WIP] use lock-free queue Ruobing Han 2022-06-20 19:01:28 -0400
  • 7d29a409f6 fix bug for inserting sync after kernelLaunch RobinHan 2022-06-18 13:39:26 -0400
  • 4791dfc9c9 fix bug for hostCompilation, change function name RobinHan 2022-06-18 13:02:19 -0400
  • b189526edb update CI to LLVM14 RobinHan 2022-06-17 23:46:45 -0400
  • d7668ccd86 [WIP] migriate to LLVM14 RobinHan 2022-06-17 22:20:13 -0400
  • bcdcccecc9 update README RobinHan 2022-06-17 16:38:25 -0400
  • f6ef5436de reconstruct the code constructure Han Ruobing 2022-06-07 12:53:32 -0700
  • d17128640f
    Merge pull request #11 from jchen706/master Ruobing Han 2022-06-07 19:12:40 +0000
  • d22722909a fix dwt2d cuda version and input in SC jchen706 2022-05-25 00:10:52 -0400
  • a54b2cdf82 fix dwt2d workflow cuda version jchen706 2022-05-24 21:29:16 -0400
  • 21f298524e
    Merge branch 'cupbop:master' into master Jun Chen 2022-05-24 21:16:37 -0400
  • 197abc867d
    Merge pull request #10 from unahb/master Ruobing Han 2022-05-25 01:12:33 +0000
  • 01fcd6e0cc fix dwt2d workflow cuda version jchen706 2022-05-24 21:11:52 -0400
  • 308e9284cb fix linting issues Bhanu Garg 2022-05-24 20:43:47 -0400
  • d834f31626
    Merge pull request #6 from jchen706/master Ruobing Han 2022-05-24 23:44:43 +0000
  • 52431a983d
    Merge pull request #8 from unahb/master Ruobing Han 2022-05-24 23:38:37 +0000
  • f594a8e0c8 update .gitignore remove .DS_Store files unahb 2022-05-24 13:30:01 -0700
  • 82fe95734f fix dwt2d worflow command jchen706 2022-05-22 09:05:46 -0400
  • 44c172ae99 add dwtd workflow jchen706 2022-05-22 03:47:43 -0400
  • 1add53fa86 add dwt2d example and fixes jchen706 2022-05-21 15:55:49 -0400
  • 1d0a0a228d Merge branch 'master' of https://github.com/cupbop/CuPBoP Hyesoon Kim 2022-05-19 18:42:54 -0400
  • 5cc096e9a3 adding microbench frame Hyesoon Kim 2022-05-19 18:42:30 -0400
  • 35da215db1
    Update README.md Hyesoon Kim 2022-05-18 23:14:13 -0400
  • 4ff2041677 update README.md Han Ruobing 2022-05-06 16:15:17 -0400
  • 1b0057b56b update how to contribute Han Ruobing 2022-05-06 16:08:28 -0400
  • 4a94d1b4bb update README.md Han Ruobing 2022-05-06 16:01:12 -0400
  • e503709480 update READMe.md Han Ruobing 2022-05-04 21:11:25 -0400
  • a7969e8f74 add CI/CD for unittest Han Ruobing 2022-05-04 09:11:31 -0400
  • 055de024df Empty-Commit Han Ruobing 2022-05-04 09:04:53 -0400
  • f8e72916c1 add codebase for TACO submission Han Ruobing 2022-05-04 08:59:38 -0400
  • 897af29748 add CI Ruobing Han 2022-01-13 13:30:45 -0500
  • addf0a95b7 add backbone, including basic features for compilation RobinHan 2022-01-11 11:01:42 -0500