Replace FMA_EXPR with one internal fn per optab

author Richard Sandiford <richard.sandiford@linaro.org>

Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)

committer Richard Sandiford <rsandifo@gcc.gnu.org>

Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)
author Richard Sandiford <richard.sandiford@linaro.org>
Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)
committer Richard Sandiford <rsandifo@gcc.gnu.org>
Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)
diff --git a/gcc/ChangeLog b/gcc/ChangeLog

index cc71c8b12e9fe7a193b406b5c36185acd823b5f0..9351896f6d3cc146835692ea3337e6283266c224 100644 (file)
--- a/gcc/ChangeLog
+++ b/gcc/ChangeLog
@@ -1,3 +1,61 @@
+2018-05-18  Richard Sandiford  <richard.sandiford@linaro.org>
+
+       * doc/sourcebuild.texi (scalar_all_fma): Document.
+       * tree.def (FMA_EXPR): Delete.
+       * internal-fn.def (FMA, FMS, FNMA, FNMS): New internal functions.
+       * internal-fn.c (ternary_direct): New macro.
+       (expand_ternary_optab_fn): Likewise.
+       (direct_ternary_optab_supported_p): Likewise.
+       * Makefile.in (build/genmatch.o): Depend on case-fn-macros.h.
+       * builtins.c (fold_builtin_fma): Delete.
+       (fold_builtin_3): Don't call it.
+       * cfgexpand.c (expand_debug_expr): Remove FMA_EXPR handling.
+       * expr.c (expand_expr_real_2): Likewise.
+       * fold-const.c (operand_equal_p): Likewise.
+       (fold_ternary_loc): Likewise.
+       * gimple-pretty-print.c (dump_ternary_rhs): Likewise.
+       * gimple.c (DEFTREECODE): Likewise.
+       * gimplify.c (gimplify_expr): Likewise.
+       * optabs-tree.c (optab_for_tree_code): Likewise.
+       * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
+       * tree-eh.c (operation_could_trap_p): Likewise.
+       (stmt_could_throw_1_p): Likewise.
+       * tree-inline.c (estimate_operator_cost): Likewise.
+       * tree-pretty-print.c (dump_generic_node): Likewise.
+       (op_code_prio): Likewise.
+       * tree-ssa-loop-im.c (stmt_cost): Likewise.
+       * tree-ssa-operands.c (get_expr_operands): Likewise.
+       * tree.c (commutative_ternary_tree_code, add_expr): Likewise.
+       * fold-const-call.h (fold_fma): Delete.
+       * fold-const-call.c (fold_const_call_ssss): Handle CFN_FMS,
+       CFN_FNMA and CFN_FNMS.
+       (fold_fma): Delete.
+       * genmatch.c (combined_fn): New enum.
+       (commutative_ternary_tree_code): Remove FMA_EXPR handling.
+       (commutative_op): New function.
+       (commutate): Use it.  Handle more than 2 operands.
+       (dt_operand::gen_gimple_expr): Use commutative_op.
+       (parser::parse_expr): Allow :c to be used with non-binary
+       operators if the commutative operand is known.
+       * gimple-ssa-backprop.c (backprop::process_builtin_call_use): Handle
+       CFN_FMS, CFN_FNMA and CFN_FNMS.
+       (backprop::process_assign_use): Remove FMA_EXPR handling.
+       * hsa-gen.c (gen_hsa_insns_for_operation_assignment): Likewise.
+       (gen_hsa_fma): New function.
+       (gen_hsa_insn_for_internal_fn_call): Use it for IFN_FMA, IFN_FMS,
+       IFN_FNMA and IFN_FNMS.
+       * match.pd: Add folds for IFN_FMS, IFN_FNMA and IFN_FNMS.
+       * gimple-fold.h (follow_all_ssa_edges): Declare.
+       * gimple-fold.c (follow_all_ssa_edges): New function.
+       * tree-ssa-math-opts.c (convert_mult_to_fma_1): Use the
+       gimple_build interface and use follow_all_ssa_edges to fold the result.
+       (convert_mult_to_fma): Use direct_internal_fn_suppoerted_p
+       instead of checking for optabs directly.
+       * config/i386/i386.c (ix86_add_stmt_cost): Recognize FMAs as calls
+       rather than FMA_EXPRs.
+       * config/rs6000/rs6000.c (rs6000_gimple_fold_builtin): Create a
+       call to IFN_FMA instead of an FMA_EXPR.
+
  2018-05-17  Jim Wilson  <jimw@sifive.com>
  
         * expr.c (do_tablejump): When converting index to Pmode, if we have a
diff --git a/gcc/Makefile.in b/gcc/Makefile.in

index 8ec0511704de0034b53946e6137d9f51809bec01..256a21546ee3a7e845dc435bb881b849f7e660f7 100644 (file)
--- a/gcc/Makefile.in
+++ b/gcc/Makefile.in
@@ -2787,7 +2787,7 @@ build/genmddump.o : genmddump.c $(RTL_BASE_H) $(BCONFIG_H) $(SYSTEM_H)    \
    $(CORETYPES_H) $(GTM_H) errors.h $(READ_MD_H) $(GENSUPPORT_H)
  build/genmatch.o : genmatch.c $(BCONFIG_H) $(SYSTEM_H) \
    $(CORETYPES_H) errors.h $(HASH_TABLE_H) hash-map.h $(GGC_H) is-a.h \
-  tree.def builtins.def internal-fn.def
+  tree.def builtins.def internal-fn.def case-cfn-macros.h
  build/gencfn-macros.o : gencfn-macros.c $(BCONFIG_H) $(SYSTEM_H)       \
    $(CORETYPES_H) errors.h $(HASH_TABLE_H) hash-set.h builtins.def      \
    internal-fn.def
diff --git a/gcc/brig/ChangeLog b/gcc/brig/ChangeLog

index 0a2846d39242199f7f23e4301c18a14a78f138e0..8b2e06551f2f2c92ed78a108a749b541fe85b57b 100644 (file)
--- a/gcc/brig/ChangeLog
+++ b/gcc/brig/ChangeLog
@@ -1,3 +1,11 @@
+2018-05-18  Richard Sandiford  <richard.sandiford@linaro.org>
+
+       * brigfrontend/brig-function.cc
+       (brig_function::get_builtin_for_hsa_opcode): Use BUILT_IN_FMA
+       for BRIG_OPCODE_FMA.
+       (brig_function::get_tree_code_for_hsa_opcode): Treat BUILT_IN_FMA
+       as a call.
+
  2018-05-04  Pekka Jääskeläinen  <pekka.jaaskelainen@parmance.com>
  
         * brig/brigfrontend/brig-basic-inst-handler.cc:  Fix handling of NOPs.
diff --git a/gcc/brig/brigfrontend/brig-function.cc b/gcc/brig/brigfrontend/brig-function.cc

index f0c499d47f6d11419a30af67f9485b50fe8811bd..e019b8646347df23c7fbd0952dad98fc4c7bc038 100644 (file)
--- a/gcc/brig/brigfrontend/brig-function.cc
+++ b/gcc/brig/brigfrontend/brig-function.cc
@@ -1218,6 +1218,7 @@ brig_function::get_builtin_for_hsa_opcode
      case BRIG_OPCODE_NEXP2:
        builtin = mathfn_built_in (builtin_type, BUILT_IN_EXP2);
        break;
+    case BRIG_OPCODE_FMA:
      case BRIG_OPCODE_NFMA:
        builtin = mathfn_built_in (builtin_type, BUILT_IN_FMA);
        break;
@@ -1460,8 +1461,6 @@ brig_function::get_tree_code_for_hsa_opcode
         return CALL_EXPR;
        else
         return MAX_EXPR;
-    case BRIG_OPCODE_FMA:
-      return FMA_EXPR;
      case BRIG_OPCODE_ABS:
        return ABS_EXPR;
      case BRIG_OPCODE_SHL:
@@ -1496,6 +1495,7 @@ brig_function::get_tree_code_for_hsa_opcode
        /* Implement as 1/f (x).  gcc should pattern detect that and
          use a native instruction, if available, for it.  */
        return TREE_LIST;
+    case BRIG_OPCODE_FMA:
      case BRIG_OPCODE_FLOOR:
      case BRIG_OPCODE_CEIL:
      case BRIG_OPCODE_SQRT:
diff --git a/gcc/builtins.c b/gcc/builtins.c

index 0097d5bae97dcb480b831b537f92eff9524bb18a..9a2bf8c7d3861387f55af171b5e9552e376c67c6 100644 (file)
--- a/gcc/builtins.c
+++ b/gcc/builtins.c
@@ -8340,21 +8340,6 @@ fold_builtin_abs (location_t loc, tree arg, tree type)
    return fold_build1_loc (loc, ABS_EXPR, type, arg);
  }
  
-/* Fold a call to fma, fmaf, or fmal with arguments ARG[012].  */
-
-static tree
-fold_builtin_fma (location_t loc, tree arg0, tree arg1, tree arg2, tree type)
-{
-  /* ??? Only expand to FMA_EXPR if it's directly supported.  */
-  if (validate_arg (arg0, REAL_TYPE)
-      && validate_arg (arg1, REAL_TYPE)
-      && validate_arg (arg2, REAL_TYPE)
-      && optab_handler (fma_optab, TYPE_MODE (type)) != CODE_FOR_nothing)
-    return fold_build3_loc (loc, FMA_EXPR, type, arg0, arg1, arg2);
-
-  return NULL_TREE;
-}
-
  /* Fold a call to builtin carg(a+bi) -> atan2(b,a).  */
  
  static tree
@@ -9260,10 +9245,6 @@ fold_builtin_3 (location_t loc, tree fndecl,
      CASE_FLT_FN (BUILT_IN_SINCOS):
        return fold_builtin_sincos (loc, arg0, arg1, arg2);
  
-    CASE_FLT_FN (BUILT_IN_FMA):
-    CASE_FLT_FN_FLOATN_NX (BUILT_IN_FMA):
-      return fold_builtin_fma (loc, arg0, arg1, arg2, type);
-
      CASE_FLT_FN (BUILT_IN_REMQUO):
        if (validate_arg (arg0, REAL_TYPE)
           && validate_arg (arg1, REAL_TYPE)
diff --git a/gcc/c/ChangeLog b/gcc/c/ChangeLog

index f31c86fbf958f9a65a2991135037d570cdbdeb95..f3a387de226a4236b4abe6d9f0b1ae165139725a 100644 (file)
--- a/gcc/c/ChangeLog
+++ b/gcc/c/ChangeLog
@@ -1,3 +1,8 @@
+2018-05-18  Richard Sandiford  <richard.sandiford@linaro.org>
+
+       * gimple-parser.c (c_parser_gimple_postfix_expression): Remove
+       __FMA_EXPR handlng.
+
  2018-05-17  Richard Sandiford  <richard.sandiford@linaro.org>
  
         * gimple-parser.c: Include internal-fn.h.
diff --git a/gcc/c/gimple-parser.c b/gcc/c/gimple-parser.c

index c9abe24bfe619fb6e0b513abb2e10d60ba7220dd..8f1c4425c091037153b5f388363a57f1e8b0811a 100644 (file)
--- a/gcc/c/gimple-parser.c
+++ b/gcc/c/gimple-parser.c
@@ -952,27 +952,6 @@ c_parser_gimple_postfix_expression (c_parser *parser)
               expr.value = fold_convert (type, val);
               return expr;
             }
-         else if (strcmp (IDENTIFIER_POINTER (id), "__FMA") == 0)
-           {
-             c_parser_consume_token (parser);
-             auto_vec<tree> args;
-
-             if (c_parser_require (parser, CPP_OPEN_PAREN, "expected %<(%>"))
-               {
-                 c_parser_gimple_expr_list (parser, &args);
-                 c_parser_skip_until_found (parser, CPP_CLOSE_PAREN,
-                                            "expected %<)%>");
-               }
-             if (args.length () != 3)
-               {
-                 error_at (loc, "invalid number of operands to __FMA");
-                 expr.value = error_mark_node;
-                 return expr;
-               }
-             expr.value = build3_loc (loc, FMA_EXPR, TREE_TYPE (args[0]),
-                                      args[0], args[1], args[2]);
-             return expr;
-           }
  
           /* SSA name.  */
           unsigned version, ver_offset;
diff --git a/gcc/cfgexpand.c b/gcc/cfgexpand.c

index ef143a35aee5d0dcecfb19491d89dbbef8725ff0..5c323be9b33a9dee3eee18b4caf27279eb96a13b 100644 (file)
--- a/gcc/cfgexpand.c
+++ b/gcc/cfgexpand.c
@@ -4202,7 +4202,6 @@ expand_debug_expr (tree exp)
         case SAD_EXPR:
         case WIDEN_MULT_PLUS_EXPR:
         case WIDEN_MULT_MINUS_EXPR:
-       case FMA_EXPR:
           goto ternary;
  
         case TRUTH_ANDIF_EXPR:
@@ -5190,9 +5189,6 @@ expand_debug_expr (tree exp)
         }
        return NULL;
  
-    case FMA_EXPR:
-      return simplify_gen_ternary (FMA, mode, inner_mode, op0, op1, op2);
-
      default:
      flag_unsupported:
        if (flag_checking)
diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c

index e19864df60ae903b642cd28be2398f2eacd8da2a..184b721dbf32bf7e7e7d61d98f4f45caae7ebef2 100644 (file)
--- a/gcc/config/i386/i386.c
+++ b/gcc/config/i386/i386.c
@@ -51133,19 +51133,20 @@ ix86_add_stmt_cost (void *data, int count, enum vect_cost_for_stmt kind,
    tree vectype = stmt_info ? stmt_vectype (stmt_info) : NULL_TREE;
    int stmt_cost = - 1;
  
+  bool fp = false;
+  machine_mode mode = TImode;
+
+  if (vectype != NULL)
+    {
+      fp = FLOAT_TYPE_P (vectype);
+      mode = TYPE_MODE (vectype);
+    }
+
    if ((kind == vector_stmt || kind == scalar_stmt)
        && stmt_info
        && stmt_info->stmt && gimple_code (stmt_info->stmt) == GIMPLE_ASSIGN)
      {
        tree_code subcode = gimple_assign_rhs_code (stmt_info->stmt);
-      bool fp = false;
-      machine_mode mode = TImode;
-
-      if (vectype != NULL)
-       {
-         fp = FLOAT_TYPE_P (vectype);
-         mode = TYPE_MODE (vectype);
-       }
        /*machine_mode inner_mode = mode;
        if (VECTOR_MODE_P (mode))
         inner_mode = GET_MODE_INNER (mode);*/
@@ -51176,12 +51177,6 @@ ix86_add_stmt_cost (void *data, int count, enum vect_cost_for_stmt kind,
         case MULT_HIGHPART_EXPR:
           stmt_cost = ix86_multiplication_cost (ix86_cost, mode);
           break;
-       case FMA_EXPR:
-          stmt_cost = ix86_vec_cost (mode,
-                                    mode == SFmode ? ix86_cost->fmass
-                                    : ix86_cost->fmasd,
-                                    true);
-         break;
         case NEGATE_EXPR:
           if (SSE_FLOAT_MODE_P (mode) && TARGET_SSE_MATH)
             stmt_cost = ix86_cost->sse_op;
@@ -51244,6 +51239,24 @@ ix86_add_stmt_cost (void *data, int count, enum vect_cost_for_stmt kind,
           break;
         }
      }
+
+  combined_fn cfn;
+  if ((kind == vector_stmt || kind == scalar_stmt)
+      && stmt_info
+      && stmt_info->stmt
+      && (cfn = gimple_call_combined_fn (stmt_info->stmt)) != CFN_LAST)
+    switch (cfn)
+      {
+      case CFN_FMA:
+       stmt_cost = ix86_vec_cost (mode,
+                                  mode == SFmode ? ix86_cost->fmass
+                                  : ix86_cost->fmasd,
+                                  true);
+       break;
+      default:
+       break;
+      }
+
    /* If we do elementwise loads into a vector then we are bound by
       latency and execution resources for the many scalar loads
       (AGU and load ports).  Try to account for this by scaling the
diff --git a/gcc/config/rs6000/rs6000.c b/gcc/config/rs6000/rs6000.c

index 877d6bae5ad72deff074833e90794ea65d9ce500..d62abdff7ee4ca9afbcabe290fef42feb86abb73 100644 (file)
--- a/gcc/config/rs6000/rs6000.c
+++ b/gcc/config/rs6000/rs6000.c
@@ -15880,7 +15880,9 @@ rs6000_gimple_fold_builtin (gimple_stmt_iterator *gsi)
         arg1 = gimple_call_arg (stmt, 1);
         tree arg2 = gimple_call_arg (stmt, 2);
         lhs = gimple_call_lhs (stmt);
-       gimple *g = gimple_build_assign (lhs, FMA_EXPR, arg0, arg1, arg2);
+       gcall *g = gimple_build_call_internal (IFN_FMA, 3, arg0, arg1, arg2);
+       gimple_call_set_lhs (g, lhs);
+       gimple_call_set_nothrow (g, true);
         gimple_set_location (g, gimple_location (stmt));
         gsi_replace (gsi, g, true);
         return true;
diff --git a/gcc/cp/ChangeLog b/gcc/cp/ChangeLog

index 7d3f0c20ea77e033318f004fad8a5b2f3f0e2444..014662b16a52fe945d270f37183026fef6456c32 100644 (file)
--- a/gcc/cp/ChangeLog
+++ b/gcc/cp/ChangeLog
@@ -1,3 +1,8 @@
+2018-05-18  Richard Sandiford  <richard.sandiford@linaro.org>
+
+       * constexpr.c (cxx_eval_constant_expression): Remove FMA_EXPR handling.
+       (potential_constant_expression_1): Likewise.
+
  2018-05-16  Marek Polacek  <polacek@redhat.com>
  
         PR c++/85363
diff --git a/gcc/cp/constexpr.c b/gcc/cp/constexpr.c

index 8c6ec555ca9aecda603fd25fd581ca645c843894..fa1cfce3839afb66f1772d67ead4018663ee1c85 100644 (file)
--- a/gcc/cp/constexpr.c
+++ b/gcc/cp/constexpr.c
@@ -4574,7 +4574,6 @@ cxx_eval_constant_expression (const constexpr_ctx *ctx, tree t,
                              non_constant_p, overflow_p);
        break;
  
-    case FMA_EXPR:
      case VEC_PERM_EXPR:
        r = cxx_eval_trinary_expression (ctx, t, lval,
                                        non_constant_p, overflow_p);
@@ -5999,7 +5998,6 @@ potential_constant_expression_1 (tree t, bool want_rval, bool strict, bool now,
           return false;
        return true;
  
-    case FMA_EXPR:
      case VEC_PERM_EXPR:
       for (i = 0; i < 3; ++i)
        if (!RECUR (TREE_OPERAND (t, i), true))
diff --git a/gcc/doc/sourcebuild.texi b/gcc/doc/sourcebuild.texi

index 4605a492694af3de30ff29840635e6f2583e9bf7..dfb057805a6e58d4756b719afc10b0740c2f0a52 100644 (file)
--- a/gcc/doc/sourcebuild.texi
+++ b/gcc/doc/sourcebuild.texi
@@ -2251,6 +2251,11 @@ Target supports @option{-pie}, @option{-fpie} and @option{-fPIE}.
  @item rdynamic
  Target supports @option{-rdynamic}.
  
+@item scalar_all_fma
+Target supports all four fused multiply-add optabs for both @code{float}
+and @code{double}.  These optabs are: @code{fma_optab}, @code{fms_optab},
+@code{fnma_optab} and @code{fnms_optab}.
+
  @item section_anchors
  Target supports section anchors.
  
diff --git a/gcc/expr.c b/gcc/expr.c

index 00a802c2ad09d44e4d49112f5e7626cb5e7d03ee..ecc52923a28a935bfc7f535ed60cbf4109836a83 100644 (file)
--- a/gcc/expr.c
+++ b/gcc/expr.c
@@ -8853,67 +8853,6 @@ expand_expr_real_2 (sepops ops, rtx target, machine_mode tmode,
        expand_operands (treeop0, treeop1, subtarget, &op0, &op1, EXPAND_NORMAL);
        return REDUCE_BIT_FIELD (expand_mult (mode, op0, op1, target, unsignedp));
  
-    case FMA_EXPR:
-      {
-       optab opt = fma_optab;
-       gimple *def0, *def2;
-
-       /* If there is no insn for FMA, emit it as __builtin_fma{,f,l}
-          call.  */
-       if (optab_handler (fma_optab, mode) == CODE_FOR_nothing)
-         {
-           tree fn = mathfn_built_in (TREE_TYPE (treeop0), BUILT_IN_FMA);
-           tree call_expr;
-
-           gcc_assert (fn != NULL_TREE);
-           call_expr = build_call_expr (fn, 3, treeop0, treeop1, treeop2);
-           return expand_builtin (call_expr, target, subtarget, mode, false);
-         }
-
-       def0 = get_def_for_expr (treeop0, NEGATE_EXPR);
-       /* The multiplication is commutative - look at its 2nd operand
-          if the first isn't fed by a negate.  */
-       if (!def0)
-         {
-           def0 = get_def_for_expr (treeop1, NEGATE_EXPR);
-           /* Swap operands if the 2nd operand is fed by a negate.  */
-           if (def0)
-             std::swap (treeop0, treeop1);
-         }
-       def2 = get_def_for_expr (treeop2, NEGATE_EXPR);
-
-       op0 = op2 = NULL;
-
-       if (def0 && def2
-           && optab_handler (fnms_optab, mode) != CODE_FOR_nothing)
-         {
-           opt = fnms_optab;
-           op0 = expand_normal (gimple_assign_rhs1 (def0));
-           op2 = expand_normal (gimple_assign_rhs1 (def2));
-         }
-       else if (def0
-                && optab_handler (fnma_optab, mode) != CODE_FOR_nothing)
-         {
-           opt = fnma_optab;
-           op0 = expand_normal (gimple_assign_rhs1 (def0));
-         }
-       else if (def2
-                && optab_handler (fms_optab, mode) != CODE_FOR_nothing)
-         {
-           opt = fms_optab;
-           op2 = expand_normal (gimple_assign_rhs1 (def2));
-         }
-
-       if (op0 == NULL)
-         op0 = expand_expr (treeop0, subtarget, VOIDmode, EXPAND_NORMAL);
-       if (op2 == NULL)
-         op2 = expand_normal (treeop2);
-       op1 = expand_normal (treeop1);
-
-       return expand_ternary_op (TYPE_MODE (type), opt,
-                                 op0, op1, op2, target, 0);
-      }
-
      case MULT_EXPR:
        /* If this is a fixed-point operation, then we cannot use the code
          below because "expand_mult" doesn't support sat/no-sat fixed-point
diff --git a/gcc/fold-const-call.c b/gcc/fold-const-call.c

index 49694922b5c4cf75cdf9d528193fc43b8624e05b..06a420601c000221fe4430c951b2093fe55fd59a 100644 (file)
--- a/gcc/fold-const-call.c
+++ b/gcc/fold-const-call.c
@@ -1606,6 +1606,26 @@ fold_const_call_ssss (real_value *result, combined_fn fn,
      CASE_CFN_FMA_FN:
        return do_mpfr_arg3 (result, mpfr_fma, arg0, arg1, arg2, format);
  
+    case CFN_FMS:
+      {
+       real_value new_arg2 = real_value_negate (arg2);
+       return do_mpfr_arg3 (result, mpfr_fma, arg0, arg1, &new_arg2, format);
+      }
+
+    case CFN_FNMA:
+      {
+       real_value new_arg0 = real_value_negate (arg0);
+       return do_mpfr_arg3 (result, mpfr_fma, &new_arg0, arg1, arg2, format);
+      }
+
+    case CFN_FNMS:
+      {
+       real_value new_arg0 = real_value_negate (arg0);
+       real_value new_arg2 = real_value_negate (arg2);
+       return do_mpfr_arg3 (result, mpfr_fma, &new_arg0, arg1,
+                            &new_arg2, format);
+      }
+
      default:
        return false;
      }
@@ -1719,20 +1739,3 @@ fold_const_call (combined_fn fn, tree type, tree arg0, tree arg1, tree arg2)
        return fold_const_call_1 (fn, type, arg0, arg1, arg2);
      }
  }
-
-/* Fold a fma operation with arguments ARG[012].  */
-
-tree
-fold_fma (location_t, tree type, tree arg0, tree arg1, tree arg2)
-{
-  REAL_VALUE_TYPE result;
-  if (real_cst_p (arg0)
-      && real_cst_p (arg1)
-      && real_cst_p (arg2)
-      && do_mpfr_arg3 (&result, mpfr_fma, TREE_REAL_CST_PTR (arg0),
-                      TREE_REAL_CST_PTR (arg1), TREE_REAL_CST_PTR (arg2),
-                      REAL_MODE_FORMAT (TYPE_MODE (type))))
-    return build_real (type, result);
-
-  return NULL_TREE;
-}
diff --git a/gcc/fold-const-call.h b/gcc/fold-const-call.h

index c7629e34ae8a2d314820084ed5839450f1576c58..7eeac9ac9cf520a49c3e4ca54380747360a368d9 100644 (file)
--- a/gcc/fold-const-call.h
+++ b/gcc/fold-const-call.h
@@ -23,7 +23,6 @@ along with GCC; see the file COPYING3.  If not see
  tree fold_const_call (combined_fn, tree, tree);
  tree fold_const_call (combined_fn, tree, tree, tree);
  tree fold_const_call (combined_fn, tree, tree, tree, tree);
-tree fold_fma (location_t, tree, tree, tree, tree);
  tree build_cmp_result (tree type, int res);
  
  #endif
diff --git a/gcc/fold-const.c b/gcc/fold-const.c

index faa184a2bbdd3df8ff05b896e60eeadeb23db327..dc277440aa9a5c354ea79b3a02e46a49c72ea4bd 100644 (file)
--- a/gcc/fold-const.c
+++ b/gcc/fold-const.c
@@ -3297,7 +3297,6 @@ operand_equal_p (const_tree arg0, const_tree arg1, unsigned int flags)
         case TRUTH_ORIF_EXPR:
           return OP_SAME (0) && OP_SAME (1);
  
-       case FMA_EXPR:
         case WIDEN_MULT_PLUS_EXPR:
         case WIDEN_MULT_MINUS_EXPR:
           if (!OP_SAME (2))
@@ -11708,17 +11707,6 @@ fold_ternary_loc (location_t loc, enum tree_code code, tree type,
  
        return NULL_TREE;
  
-    case FMA_EXPR:
-      /* For integers we can decompose the FMA if possible.  */
-      if (TREE_CODE (arg0) == INTEGER_CST
-         && TREE_CODE (arg1) == INTEGER_CST)
-       return fold_build2_loc (loc, PLUS_EXPR, type,
-                               const_binop (MULT_EXPR, arg0, arg1), arg2);
-      if (integer_zerop (arg2))
-       return fold_build2_loc (loc, MULT_EXPR, type, arg0, arg1);
-
-      return fold_fma (loc, type, arg0, arg1, arg2);
-
      case VEC_PERM_EXPR:
        if (TREE_CODE (arg2) == VECTOR_CST)
         {
diff --git a/gcc/genmatch.c b/gcc/genmatch.c

index be6efe3bf1274acdb6ab3519f4ffdf3c2f42360a..c794e4d30cf5f373b2be3a06146ffb29d5d1a939 100644 (file)
--- a/gcc/genmatch.c
+++ b/gcc/genmatch.c
@@ -241,6 +241,20 @@ enum internal_fn {
    IFN_LAST
  };
  
+enum combined_fn {
+#define DEF_BUILTIN(ENUM, N, C, T, LT, B, F, NA, AT, IM, COND) \
+  CFN_##ENUM = int (ENUM),
+#include "builtins.def"
+
+#define DEF_INTERNAL_FN(CODE, FLAGS, FNSPEC) \
+  CFN_##CODE = int (END_BUILTINS) + int (IFN_##CODE),
+#include "internal-fn.def"
+
+  CFN_LAST
+};
+
+#include "case-cfn-macros.h"
+
  /* Return true if CODE represents a commutative tree code.  Otherwise
     return false.  */
  bool
@@ -288,7 +302,6 @@ commutative_ternary_tree_code (enum tree_code code)
      case WIDEN_MULT_PLUS_EXPR:
      case WIDEN_MULT_MINUS_EXPR:
      case DOT_PROD_EXPR:
-    case FMA_EXPR:
        return true;
  
      default:
@@ -450,6 +463,44 @@ is_a_helper <user_id *>::test (id_base *id)
    return id->kind == id_base::USER;
  }
  
+/* If ID has a pair of consecutive, commutative operands, return the
+   index of the first, otherwise return -1.  */
+
+static int
+commutative_op (id_base *id)
+{
+  if (operator_id *code = dyn_cast <operator_id *> (id))
+    {
+      if (commutative_tree_code (code->code)
+         || commutative_ternary_tree_code (code->code))
+       return 0;
+      return -1;
+    }
+  if (fn_id *fn = dyn_cast <fn_id *> (id))
+    switch (fn->fn)
+      {
+      CASE_CFN_FMA:
+      case CFN_FMS:
+      case CFN_FNMA:
+      case CFN_FNMS:
+       return 0;
+
+      default:
+       return -1;
+      }
+  if (user_id *uid = dyn_cast<user_id *> (id))
+    {
+      int res = commutative_op (uid->substitutes[0]);
+      if (res < 0)
+       return 0;
+      for (unsigned i = 1; i < uid->substitutes.length (); ++i)
+       if (res != commutative_op (uid->substitutes[i]))
+         return -1;
+      return res;
+    }
+  return -1;
+}
+
  /* Add a predicate identifier to the hash.  */
  
  static predicate_id *
@@ -946,6 +997,9 @@ commutate (operand *op, vec<vec<user_id *> > &for_vec)
    if (!e->is_commutative)
      return ret;
  
+  /* The operation is always binary if it isn't inherently commutative.  */
+  int natural_opno = commutative_op (e->operation);
+  unsigned int opno = natural_opno >= 0 ? natural_opno : 0;
    for (unsigned i = 0; i < result.length (); ++i)
      {
        expr *ne = new expr (e);
@@ -994,9 +1048,11 @@ commutate (operand *op, vec<vec<user_id *> > &for_vec)
             }
         }
        ne->is_commutative = false;
-      // result[i].length () is 2 since e->operation is binary
-      for (unsigned j = result[i].length (); j; --j)
-       ne->append_op (result[i][j-1]);
+      for (unsigned j = 0; j < result[i].length (); ++j)
+       {
+         int old_j = (j == opno ? opno + 1 : j == opno + 1 ? opno : j);
+         ne->append_op (result[i][old_j]);
+       }
        ret.safe_push (ne);
      }
  
@@ -2759,24 +2815,18 @@ dt_operand::gen_gimple_expr (FILE *f, int indent)
    /* While the toplevel operands are canonicalized by the caller
       after valueizing operands of sub-expressions we have to
       re-canonicalize operand order.  */
-  if (operator_id *code = dyn_cast <operator_id *> (id))
+  int opno = commutative_op (id);
+  if (opno >= 0)
      {
-      /* ???  We can't canonicalize tcc_comparison operands here
-         because that requires changing the comparison code which
-        we already matched...  */
-      if (commutative_tree_code (code->code)
-         || commutative_ternary_tree_code (code->code))
-       {
-         char child_opname0[20], child_opname1[20];
-         gen_opname (child_opname0, 0);
-         gen_opname (child_opname1, 1);
-         fprintf_indent (f, indent,
-                         "if (tree_swap_operands_p (%s, %s))\n",
-                         child_opname0, child_opname1);
-         fprintf_indent (f, indent,
-                         "  std::swap (%s, %s);\n",
-                         child_opname0, child_opname1);
-       }
+      char child_opname0[20], child_opname1[20];
+      gen_opname (child_opname0, opno);
+      gen_opname (child_opname1, opno + 1);
+      fprintf_indent (f, indent,
+                     "if (tree_swap_operands_p (%s, %s))\n",
+                     child_opname0, child_opname1);
+      fprintf_indent (f, indent,
+                     "  std::swap (%s, %s);\n",
+                     child_opname0, child_opname1);
      }
  
    return n_braces;
@@ -4217,11 +4267,14 @@ parser::parse_expr ()
                       e->operation->id, e->operation->nargs, e->ops.length ());
           if (is_commutative)
             {
-             if (e->ops.length () == 2)
+             if (e->ops.length () == 2
+                 || commutative_op (e->operation) >= 0)
                 e->is_commutative = true;
               else
-               fatal_at (token, "only binary operators or function with "
-                         "two arguments can be marked commutative");
+               fatal_at (token, "only binary operators or functions with "
+                         "two arguments can be marked commutative, "
+                         "unless the operation is known to be inherently "
+                         "commutative");
             }
           e->expr_type = expr_type;
           return op;
diff --git a/gcc/gimple-fold.c b/gcc/gimple-fold.c

index 4d8842e7c9998592771a26dba0061cc84e8d7ce1..b45798c36ea556c63c8e86ce3a92ee1028cacc49 100644 (file)
--- a/gcc/gimple-fold.c
+++ b/gcc/gimple-fold.c
@@ -4967,6 +4967,14 @@ follow_single_use_edges (tree val)
    return val;
  }
  
+/* Valueization callback that follows all SSA edges.  */
+
+tree
+follow_all_ssa_edges (tree val)
+{
+  return val;
+}
+
  /* Fold the statement pointed to by GSI.  In some cases, this function may
     replace the whole statement with a new one.  Returns true iff folding
     makes any changes.
diff --git a/gcc/gimple-fold.h b/gcc/gimple-fold.h

index 282b1d6c0b861b5a22f0628aeccfd4f69c02e2b0..0a28ec7c5a231c4c4204d2d764570ec0164a5eea 100644 (file)
--- a/gcc/gimple-fold.h
+++ b/gcc/gimple-fold.h
@@ -41,6 +41,7 @@ extern bool arith_overflowed_p (enum tree_code, const_tree, const_tree,
                                 const_tree);
  extern tree no_follow_ssa_edges (tree);
  extern tree follow_single_use_edges (tree);
+extern tree follow_all_ssa_edges (tree);
  extern tree gimple_fold_stmt_to_constant_1 (gimple *, tree (*) (tree),
                                             tree (*) (tree) = no_follow_ssa_edges);
  extern tree gimple_fold_stmt_to_constant (gimple *, tree (*) (tree));
diff --git a/gcc/gimple-pretty-print.c b/gcc/gimple-pretty-print.c

index afe01471a45cb0d73de270540dcb69982659c3f4..49e9e1276da03889dc25b53861bff241a0737c20 100644 (file)
--- a/gcc/gimple-pretty-print.c
+++ b/gcc/gimple-pretty-print.c
@@ -490,27 +490,6 @@ dump_ternary_rhs (pretty_printer *buffer, gassign *gs, int spc,
        pp_greater (buffer);
        break;
  
-    case FMA_EXPR:
-      if (flags & TDF_GIMPLE)
-       {
-         pp_string (buffer, "__FMA (");
-         dump_generic_node (buffer, gimple_assign_rhs1 (gs), spc, flags, false);
-         pp_comma (buffer);
-         dump_generic_node (buffer, gimple_assign_rhs2 (gs), spc, flags, false);
-         pp_comma (buffer);
-         dump_generic_node (buffer, gimple_assign_rhs3 (gs), spc, flags, false);
-         pp_right_paren (buffer);
-       }
-      else
-       {
-         dump_generic_node (buffer, gimple_assign_rhs1 (gs), spc, flags, false);
-         pp_string (buffer, " * ");
-         dump_generic_node (buffer, gimple_assign_rhs2 (gs), spc, flags, false);
-         pp_string (buffer, " + ");
-         dump_generic_node (buffer, gimple_assign_rhs3 (gs), spc, flags, false);
-       }
-      break;
-
      case DOT_PROD_EXPR:
        pp_string (buffer, "DOT_PROD_EXPR <");
        dump_generic_node (buffer, gimple_assign_rhs1 (gs), spc, flags, false);
diff --git a/gcc/gimple-ssa-backprop.c b/gcc/gimple-ssa-backprop.c

index 27aa575ab08e1a7329a160946f90c7ad2154e512..9ab655c43e8eb00653a22ce24a8682d513c6876e 100644 (file)
--- a/gcc/gimple-ssa-backprop.c
+++ b/gcc/gimple-ssa-backprop.c
@@ -375,6 +375,9 @@ backprop::process_builtin_call_use (gcall *call, tree rhs, usage_info *info)
  
      CASE_CFN_FMA:
      CASE_CFN_FMA_FN:
+    case CFN_FMS:
+    case CFN_FNMA:
+    case CFN_FNMS:
        /* In X * X + Y, where Y is distinct from X, the sign of X doesn't
          matter.  */
        if (gimple_call_arg (call, 0) == rhs
@@ -420,15 +423,6 @@ backprop::process_assign_use (gassign *assign, tree rhs, usage_info *info)
         }
        break;
  
-    case FMA_EXPR:
-      /* In X * X + Y, where Y is distinct from X, the sign of X doesn't
-        matter.  */
-      if (gimple_assign_rhs1 (assign) == rhs
-         && gimple_assign_rhs2 (assign) == rhs
-         && gimple_assign_rhs3 (assign) != rhs)
-       info->flags.ignore_sign = true;
-      break;
-
      case MULT_EXPR:
        /* In X * X, the sign of X doesn't matter.  */
        if (gimple_assign_rhs1 (assign) == rhs
diff --git a/gcc/gimple.c b/gcc/gimple.c

index 745cdf33586ab588af6e40744a853fbde9347fc3..4b91151873cb578233a1f6f51358384144389f83 100644 (file)
--- a/gcc/gimple.c
+++ b/gcc/gimple.c
@@ -2150,8 +2150,7 @@ get_gimple_rhs_num_ops (enum tree_code code)
        || (SYM) == REALIGN_LOAD_EXPR                                        \
        || (SYM) == VEC_COND_EXPR                                                    \
        || (SYM) == VEC_PERM_EXPR                                             \
-      || (SYM) == BIT_INSERT_EXPR                                          \
-      || (SYM) == FMA_EXPR) ? GIMPLE_TERNARY_RHS                           \
+      || (SYM) == BIT_INSERT_EXPR) ? GIMPLE_TERNARY_RHS                            \
     : ((SYM) == CONSTRUCTOR                                                 \
        || (SYM) == OBJ_TYPE_REF                                             \
        || (SYM) == ASSERT_EXPR                                              \
diff --git a/gcc/gimplify.c b/gcc/gimplify.c

index 9942afdd61fc21e24c5ce18c3aaf55c043cdd151..9771804f27e53aed6c723171f4a72db59347569d 100644 (file)
--- a/gcc/gimplify.c
+++ b/gcc/gimplify.c
@@ -12086,7 +12086,6 @@ gimplify_expr (tree *expr_p, gimple_seq *pre_p, gimple_seq *post_p,
           }
           break;
  
-       case FMA_EXPR:
         case VEC_PERM_EXPR:
           /* Classified as tcc_expression.  */
           goto expr_3;
diff --git a/gcc/hsa-gen.c b/gcc/hsa-gen.c

index 7974fffe360f77eb4e35a8c4a588e4b2865d5de3..74d5f07b3cb486d96d2dc567553ef33210e4c8aa 100644 (file)
--- a/gcc/hsa-gen.c
+++ b/gcc/hsa-gen.c
@@ -3178,23 +3178,6 @@ gen_hsa_insns_for_operation_assignment (gimple *assign, hsa_bb *hbb)
      case NEGATE_EXPR:
        opcode = BRIG_OPCODE_NEG;
        break;
-    case FMA_EXPR:
-      /* There is a native HSA instruction for scalar FMAs but not for vector
-        ones.  */
-      if (TREE_CODE (TREE_TYPE (lhs)) == VECTOR_TYPE)
-       {
-         hsa_op_reg *dest
-           = hsa_cfun->reg_for_gimple_ssa (gimple_assign_lhs (assign));
-         hsa_op_with_type *op1 = hsa_reg_or_immed_for_gimple_op (rhs1, hbb);
-         hsa_op_with_type *op2 = hsa_reg_or_immed_for_gimple_op (rhs2, hbb);
-         hsa_op_with_type *op3 = hsa_reg_or_immed_for_gimple_op (rhs3, hbb);
-         hsa_op_reg *tmp = new hsa_op_reg (dest->m_type);
-         gen_hsa_binary_operation (BRIG_OPCODE_MUL, tmp, op1, op2, hbb);
-         gen_hsa_binary_operation (BRIG_OPCODE_ADD, dest, tmp, op3, hbb);
-         return;
-       }
-      opcode = BRIG_OPCODE_MAD;
-      break;
      case MIN_EXPR:
        opcode = BRIG_OPCODE_MIN;
        break;
@@ -4490,6 +4473,57 @@ gen_hsa_divmod (gcall *call, hsa_bb *hbb)
    insn->set_output_in_type (dest, 0, hbb);
  }
  
+/* Emit instructions that implement FMA, FMS, FNMA or FNMS call STMT.
+   Instructions are appended to basic block HBB.  NEGATE1 is true for
+   FNMA and FNMS.  NEGATE3 is true for FMS and FNMS.  */
+
+static void
+gen_hsa_fma (gcall *call, hsa_bb *hbb, bool negate1, bool negate3)
+{
+  tree lhs = gimple_call_lhs (call);
+  if (lhs == NULL_TREE)
+    return;
+
+  tree rhs1 = gimple_call_arg (call, 0);
+  tree rhs2 = gimple_call_arg (call, 1);
+  tree rhs3 = gimple_call_arg (call, 2);
+
+  hsa_op_reg *dest = hsa_cfun->reg_for_gimple_ssa (lhs);
+  hsa_op_with_type *op1 = hsa_reg_or_immed_for_gimple_op (rhs1, hbb);
+  hsa_op_with_type *op2 = hsa_reg_or_immed_for_gimple_op (rhs2, hbb);
+  hsa_op_with_type *op3 = hsa_reg_or_immed_for_gimple_op (rhs3, hbb);
+
+  if (negate1)
+    {
+      hsa_op_reg *tmp = new hsa_op_reg (dest->m_type);
+      gen_hsa_unary_operation (BRIG_OPCODE_NEG, tmp, op1, hbb);
+      op1 = tmp;
+    }
+
+  /* There is a native HSA instruction for scalar FMAs but not for vector
+     ones.  */
+  if (TREE_CODE (TREE_TYPE (lhs)) == VECTOR_TYPE)
+    {
+      hsa_op_reg *tmp = new hsa_op_reg (dest->m_type);
+      gen_hsa_binary_operation (BRIG_OPCODE_MUL, tmp, op1, op2, hbb);
+      gen_hsa_binary_operation (negate3 ? BRIG_OPCODE_SUB : BRIG_OPCODE_ADD,
+                               dest, tmp, op3, hbb);
+    }
+  else
+    {
+      if (negate3)
+       {
+         hsa_op_reg *tmp = new hsa_op_reg (dest->m_type);
+         gen_hsa_unary_operation (BRIG_OPCODE_NEG, tmp, op3, hbb);
+         op3 = tmp;
+       }
+      hsa_insn_basic *insn = new hsa_insn_basic (4, BRIG_OPCODE_MAD,
+                                                dest->m_type, dest,
+                                                op1, op2, op3);
+      hbb->append_insn (insn);
+    }
+}
+
  /* Set VALUE to a shadow kernel debug argument and append a new instruction
     to HBB basic block.  */
  
@@ -5224,6 +5258,22 @@ gen_hsa_insn_for_internal_fn_call (gcall *stmt, hsa_bb *hbb)
        gen_hsa_insns_for_call_of_internal_fn (stmt, hbb);
        break;
  
+    case IFN_FMA:
+      gen_hsa_fma (stmt, hbb, false, false);
+      break;
+
+    case IFN_FMS:
+      gen_hsa_fma (stmt, hbb, false, true);
+      break;
+
+    case IFN_FNMA:
+      gen_hsa_fma (stmt, hbb, true, false);
+      break;
+
+    case IFN_FNMS:
+      gen_hsa_fma (stmt, hbb, true, true);
+      break;
+
      default:
        HSA_SORRY_ATV (gimple_location (stmt),
                      "support for HSA does not implement internal function: %s",
diff --git a/gcc/internal-fn.c b/gcc/internal-fn.c

index ba94a61cf11197d6a7e50b14794eb2ae64098f2b..b2c4a319a531e2b968494ca173bed143750e9538 100644 (file)
--- a/gcc/internal-fn.c
+++ b/gcc/internal-fn.c
@@ -110,6 +110,7 @@ init_internal_fns ()
  #define scatter_store_direct { 3, 3, false }
  #define unary_direct { 0, 0, true }
  #define binary_direct { 0, 0, true }
+#define ternary_direct { 0, 0, true }
  #define cond_unary_direct { 1, 1, true }
  #define cond_binary_direct { 1, 1, true }
  #define while_direct { 0, 2, false }
@@ -2982,6 +2983,9 @@ expand_while_optab_fn (internal_fn, gcall *stmt, convert_optab optab)
  #define expand_binary_optab_fn(FN, STMT, OPTAB) \
    expand_direct_optab_fn (FN, STMT, OPTAB, 2)
  
+#define expand_ternary_optab_fn(FN, STMT, OPTAB) \
+  expand_direct_optab_fn (FN, STMT, OPTAB, 3)
+
  #define expand_cond_unary_optab_fn(FN, STMT, OPTAB) \
    expand_direct_optab_fn (FN, STMT, OPTAB, 2)
  
@@ -3067,6 +3071,7 @@ multi_vector_optab_supported_p (convert_optab optab, tree_pair types,
  
  #define direct_unary_optab_supported_p direct_optab_supported_p
  #define direct_binary_optab_supported_p direct_optab_supported_p
+#define direct_ternary_optab_supported_p direct_optab_supported_p
  #define direct_cond_unary_optab_supported_p direct_optab_supported_p
  #define direct_cond_binary_optab_supported_p direct_optab_supported_p
  #define direct_mask_load_optab_supported_p direct_optab_supported_p
diff --git a/gcc/internal-fn.def b/gcc/internal-fn.def

index 4080e1698ea82404a1102ef586cce64100033e76..a5f8504849054cfd0d6d0d2467ae4f3c05441f59 100644 (file)
--- a/gcc/internal-fn.def
+++ b/gcc/internal-fn.def
@@ -57,6 +57,7 @@ along with GCC; see the file COPYING3.  If not see
  
     - unary: a normal unary optab, such as vec_reverse_<mode>
     - binary: a normal binary optab, such as vec_interleave_lo_<mode>
+   - ternary: a normal ternary optab, such as fma<mode>4
  
     - cond_binary: a conditional binary optab, such as add<mode>cc
  
@@ -138,6 +139,10 @@ DEF_INTERNAL_OPTAB_FN (WHILE_ULT, ECF_CONST | ECF_NOTHROW, while_ult, while)
  DEF_INTERNAL_OPTAB_FN (VEC_SHL_INSERT, ECF_CONST | ECF_NOTHROW,
                        vec_shl_insert, binary)
  
+DEF_INTERNAL_OPTAB_FN (FMS, ECF_CONST, fms, ternary)
+DEF_INTERNAL_OPTAB_FN (FNMA, ECF_CONST, fnma, ternary)
+DEF_INTERNAL_OPTAB_FN (FNMS, ECF_CONST, fnms, ternary)
+
  DEF_INTERNAL_OPTAB_FN (COND_ADD, ECF_CONST, cond_add, cond_binary)
  DEF_INTERNAL_OPTAB_FN (COND_SUB, ECF_CONST, cond_sub, cond_binary)
  DEF_INTERNAL_SIGNED_OPTAB_FN (COND_MIN, ECF_CONST, first,
@@ -218,6 +223,9 @@ DEF_INTERNAL_OPTAB_FN (XORSIGN, ECF_CONST, xorsign, binary)
  /* FP scales.  */
  DEF_INTERNAL_FLT_FN (LDEXP, ECF_CONST, ldexp, binary)
  
+/* Ternary math functions.  */
+DEF_INTERNAL_FLT_FN (FMA, ECF_CONST, fma, ternary)
+
  /* Unary integer ops.  */
  DEF_INTERNAL_INT_FN (CLRSB, ECF_CONST | ECF_NOTHROW, clrsb, unary)
  DEF_INTERNAL_INT_FN (CLZ, ECF_CONST | ECF_NOTHROW, clz, unary)
diff --git a/gcc/match.pd b/gcc/match.pd

index 703373090a5ebae80af23f1bb44389936f52ca27..2f4c5e6a14b219c286b6e1cedc2270235d1ec5be 100644 (file)
--- a/gcc/match.pd
+++ b/gcc/match.pd
@@ -4702,3 +4702,60 @@ DEFINE_INT_AND_FLOAT_ROUND_FN (RINT)
         || wi::geu_p (wi::to_wide (@rpos),
                       wi::to_wide (@ipos) + isize))
      (BIT_FIELD_REF @0 @rsize @rpos)))))
+
+(for fmas (FMA)
+ (simplify
+  (fmas:c (negate @0) @1 @2)
+  (IFN_FNMA @0 @1 @2))
+ (simplify
+  (fmas @0 @1 (negate @2))
+  (IFN_FMS @0 @1 @2))
+ (simplify
+  (fmas:c (negate @0) @1 (negate @2))
+  (IFN_FNMS @0 @1 @2))
+ (simplify
+  (negate (fmas@3 @0 @1 @2))
+  (if (single_use (@3))
+   (IFN_FNMS @0 @1 @2))))
+
+(simplify
+ (IFN_FMS:c (negate @0) @1 @2)
+ (IFN_FNMS @0 @1 @2))
+(simplify
+ (IFN_FMS @0 @1 (negate @2))
+ (IFN_FMA @0 @1 @2))
+(simplify
+ (IFN_FMS:c (negate @0) @1 (negate @2))
+ (IFN_FNMA @0 @1 @2))
+(simplify
+ (negate (IFN_FMS@3 @0 @1 @2))
+  (if (single_use (@3))
+   (IFN_FNMA @0 @1 @2)))
+
+(simplify
+ (IFN_FNMA:c (negate @0) @1 @2)
+ (IFN_FMA @0 @1 @2))
+(simplify
+ (IFN_FNMA @0 @1 (negate @2))
+ (IFN_FNMS @0 @1 @2))
+(simplify
+ (IFN_FNMA:c (negate @0) @1 (negate @2))
+ (IFN_FMS @0 @1 @2))
+(simplify
+ (negate (IFN_FNMA@3 @0 @1 @2))
+ (if (single_use (@3))
+  (IFN_FMS @0 @1 @2)))
+
+(simplify
+ (IFN_FNMS:c (negate @0) @1 @2)
+ (IFN_FMS @0 @1 @2))
+(simplify
+ (IFN_FNMS @0 @1 (negate @2))
+ (IFN_FNMA @0 @1 @2))
+(simplify
+ (IFN_FNMS:c (negate @0) @1 (negate @2))
+ (IFN_FMA @0 @1 @2))
+(simplify
+ (negate (IFN_FNMS@3 @0 @1 @2))
+ (if (single_use (@3))
+  (IFN_FMA @0 @1 @2)))
diff --git a/gcc/optabs-tree.c b/gcc/optabs-tree.c

index 71e172c4900a28c793cdfc1c32e07d08d6656d60..73e66544ef9fb359690f354b08060b02012a30f4 100644 (file)
--- a/gcc/optabs-tree.c
+++ b/gcc/optabs-tree.c
@@ -143,9 +143,6 @@ optab_for_tree_code (enum tree_code code, const_tree type,
               : (TYPE_SATURATING (type)
                  ? ssmsub_widen_optab : smsub_widen_optab));
  
-    case FMA_EXPR:
-      return fma_optab;
-
      case VEC_WIDEN_MULT_HI_EXPR:
        return TYPE_UNSIGNED (type) ?
         vec_widen_umult_hi_optab : vec_widen_smult_hi_optab;
diff --git a/gcc/testsuite/ChangeLog b/gcc/testsuite/ChangeLog

index 7442eb4c9e201c9d39f4ef344f2a77fc95228ead..ff49c88158b520410fd7865236536c3b70acf8d3 100644 (file)
--- a/gcc/testsuite/ChangeLog
+++ b/gcc/testsuite/ChangeLog
@@ -1,3 +1,21 @@
+2018-05-18  Richard Sandiford  <richard.sandiford@linaro.org>
+
+       * lib/target-supports.exp (check_effective_target_scalar_all_fma):
+       New proc.
+       * gcc.dg/fma-1.c: New test.
+       * gcc.dg/fma-2.c: Likewise.
+       * gcc.dg/fma-3.c: Likewise.
+       * gcc.dg/fma-4.c: Likewise.
+       * gcc.dg/fma-5.c: Likewise.
+       * gcc.dg/fma-6.c: Likewise.
+       * gcc.dg/fma-7.c: Likewise.
+       * gcc.dg/gimplefe-26.c: Use .FMA instead of __FMA and require
+       scalar_all_fma.
+       * gfortran.dg/reassoc_7.f: Pass -ffp-contract=off.
+       * gfortran.dg/reassoc_8.f: Likewise.
+       * gfortran.dg/reassoc_9.f: Likewise.
+       * gfortran.dg/reassoc_10.f: Likewise.
+
  2018-05-17  Jim Wilson  <jimw@sifive.com>
  
         * gcc.target/riscv/switch-qi.c: New.
diff --git a/gcc/testsuite/gcc.dg/fma-1.c b/gcc/testsuite/gcc.dg/fma-1.c

new file mode 100644 (file)

index 0000000..8c33cb2
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-1.c
@@ -0,0 +1,15 @@
+/* { dg-options "-O2 -fdump-tree-widening_mul" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return a * b + c;
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return a * b + c;
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FMA \(} 2 "widening_mul" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-2.c b/gcc/testsuite/gcc.dg/fma-2.c

new file mode 100644 (file)

index 0000000..41d20a4
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-2.c
@@ -0,0 +1,15 @@
+/* { dg-options "-O2 -fdump-tree-widening_mul" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return a * b - c;
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return a * b - c;
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FMS \(} 2 "widening_mul" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-3.c b/gcc/testsuite/gcc.dg/fma-3.c

new file mode 100644 (file)

index 0000000..699aa2c
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-3.c
@@ -0,0 +1,15 @@
+/* { dg-options "-O2 -fdump-tree-widening_mul" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return c - a * b;
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return c - a * b;
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FNMA \(} 2 "widening_mul" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-4.c b/gcc/testsuite/gcc.dg/fma-4.c

new file mode 100644 (file)

index 0000000..bff928f
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-4.c
@@ -0,0 +1,15 @@
+/* { dg-options "-O2 -fdump-tree-widening_mul" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return -(a * b) - c;
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return -(a * b) - c;
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FNMS \(} 2 "widening_mul" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-5.c b/gcc/testsuite/gcc.dg/fma-5.c

new file mode 100644 (file)

index 0000000..68d7597
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-5.c
@@ -0,0 +1,53 @@
+/* { dg-options "-O2 -fdump-tree-optimized" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return __builtin_fmaf (a, b, -c);
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return __builtin_fma (a, b, -c);
+}
+
+void
+f3 (float a, float b, float c, float d, float e, float *res)
+{
+  res[0] = __builtin_fmaf (a, b, -e);
+  res[1] = __builtin_fmaf (c, d, -e);
+}
+
+void
+f4 (double a, double b, double c, double d, double e, double *res)
+{
+  res[0] = __builtin_fma (a, b, -e);
+  res[1] = __builtin_fma (c, d, -e);
+}
+
+float
+f5 (float a, float b, float c)
+{
+  return -__builtin_fmaf (-a, b, c);
+}
+
+double
+f6 (double a, double b, double c)
+{
+  return -__builtin_fma (-a, b, c);
+}
+
+float
+f7 (float a, float b, float c)
+{
+  return -__builtin_fmaf (a, -b, c);
+}
+
+double
+f8 (double a, double b, double c)
+{
+  return -__builtin_fma (a, -b, c);
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FMS \(} 10 "optimized" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-6.c b/gcc/testsuite/gcc.dg/fma-6.c

new file mode 100644 (file)

index 0000000..87258ce
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-6.c
@@ -0,0 +1,67 @@
+/* { dg-options "-O2 -fdump-tree-optimized" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return __builtin_fmaf (-a, b, c);
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return __builtin_fma (-a, b, c);
+}
+
+float
+f3 (float a, float b, float c)
+{
+  return __builtin_fmaf (a, -b, c);
+}
+
+double
+f4 (double a, double b, double c)
+{
+  return __builtin_fma (a, -b, c);
+}
+
+void
+f5 (float a, float b, float c, float d, float e, float *res)
+{
+  res[0] = __builtin_fmaf (-a, b, c);
+  res[1] = __builtin_fmaf (-a, d, e);
+}
+
+void
+f6 (double a, double b, double c, double d, double e, double *res)
+{
+  res[0] = __builtin_fma (-a, b, c);
+  res[1] = __builtin_fma (-a, d, e);
+}
+
+void
+f7 (float a, float b, float c, float d, float e, float *res)
+{
+  res[0] = __builtin_fmaf (a, -b, c);
+  res[1] = __builtin_fmaf (d, -b, e);
+}
+
+void
+f8 (double a, double b, double c, double d, double e, double *res)
+{
+  res[0] = __builtin_fma (a, -b, c);
+  res[1] = __builtin_fma (d, -b, e);
+}
+
+float
+f9 (float a, float b, float c)
+{
+  return -__builtin_fmaf (a, b, -c);
+}
+
+double
+f10 (double a, double b, double c)
+{
+  return -__builtin_fma (a, b, -c);
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FNMA \(} 14 "optimized" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/fma-7.c b/gcc/testsuite/gcc.dg/fma-7.c

new file mode 100644 (file)

index 0000000..f409cc8
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/fma-7.c
@@ -0,0 +1,67 @@
+/* { dg-options "-O2 -fdump-tree-optimized" } */
+
+float
+f1 (float a, float b, float c)
+{
+  return __builtin_fmaf (-a, b, -c);
+}
+
+double
+f2 (double a, double b, double c)
+{
+  return __builtin_fma (-a, b, -c);
+}
+
+float
+f3 (float a, float b, float c)
+{
+  return __builtin_fmaf (a, -b, -c);
+}
+
+double
+f4 (double a, double b, double c)
+{
+  return __builtin_fma (a, -b, -c);
+}
+
+void
+f5 (float a, float b, float c, float d, float *res)
+{
+  res[0] = __builtin_fmaf (-a, b, -c);
+  res[1] = __builtin_fmaf (-a, d, -c);
+}
+
+void
+f6 (double a, double b, double c, double d, double *res)
+{
+  res[0] = __builtin_fma (-a, b, -c);
+  res[1] = __builtin_fma (-a, d, -c);
+}
+
+void
+f7 (float a, float b, float c, float d, float *res)
+{
+  res[0] = __builtin_fmaf (a, -b, -c);
+  res[1] = __builtin_fmaf (d, -b, -c);
+}
+
+void
+f8 (double a, double b, double c, double d, double *res)
+{
+  res[0] = __builtin_fma (a, -b, -c);
+  res[1] = __builtin_fma (d, -b, -c);
+}
+
+float
+f9 (float a, float b, float c)
+{
+  return -__builtin_fmaf (a, b, c);
+}
+
+double
+f10 (double a, double b, double c)
+{
+  return -__builtin_fma (a, b, c);
+}
+
+/* { dg-final { scan-tree-dump-times { = \.FNMS \(} 14 "optimized" { target scalar_all_fma } } } */
diff --git a/gcc/testsuite/gcc.dg/gimplefe-26.c b/gcc/testsuite/gcc.dg/gimplefe-26.c

index bc2f3b1d4caed1ce4fb50172b953317a7e758645..f4dbdb18bb66ad52f0724070df3e7ae90e56d4be 100644 (file)
--- a/gcc/testsuite/gcc.dg/gimplefe-26.c
+++ b/gcc/testsuite/gcc.dg/gimplefe-26.c
@@ -1,16 +1,15 @@
-/* { dg-do compile { target c99_runtime } } */
+/* { dg-do compile { target scalar_all_fma } } */
  /* { dg-options "-O -fgimple -fdump-tree-ssa-gimple" } */
  
  #define foo(type, num) \
  type __GIMPLE () foo_##num (type a, type b, type c) \
  { \
    type t0; \
-  t0_1 = __FMA (a, b, c); \
+  t0_1 = .FMA (a, b, c); \
    return t0_1; \
  }
  
  foo(float, 1)
  foo(double, 2)
-foo(long double, 3)
  
-/* { dg-final { scan-tree-dump-times "__FMA" 3 "ssa" } } */
+/* { dg-final { scan-tree-dump-times {\.FMA} 2 "ssa" } } */
diff --git a/gcc/testsuite/gfortran.dg/reassoc_10.f b/gcc/testsuite/gfortran.dg/reassoc_10.f

index 3720d8f414e293cc19732e039fe57016681a1e7c..4a0a6c23f03daf70da696d0edb68b26e450807a7 100644 (file)
--- a/gcc/testsuite/gfortran.dg/reassoc_10.f
+++ b/gcc/testsuite/gfortran.dg/reassoc_10.f
@@ -1,5 +1,5 @@
  ! { dg-do compile }
-! { dg-options "-O3 -ffast-math -fdump-tree-optimized" }
+! { dg-options "-O3 -ffast-math -ffp-contract=off -fdump-tree-optimized" }
  
        SUBROUTINE S55199(P,Q,Dvdph)
        implicit none
diff --git a/gcc/testsuite/gfortran.dg/reassoc_7.f b/gcc/testsuite/gfortran.dg/reassoc_7.f

index 04d2e678fa6f0ccbc6d2064207ff697c6332d42f..fce93a8a8d0e4a6d4347d4feab26aaf3c9c04e73 100644 (file)
--- a/gcc/testsuite/gfortran.dg/reassoc_7.f
+++ b/gcc/testsuite/gfortran.dg/reassoc_7.f
@@ -1,5 +1,5 @@
  ! { dg-do compile }
-! { dg-options "-O3 -ffast-math -fdump-tree-optimized" }
+! { dg-options "-O3 -ffast-math -ffp-contract=off -fdump-tree-optimized" }
  
        SUBROUTINE S55199(P,Dvdph)
        implicit none
diff --git a/gcc/testsuite/gfortran.dg/reassoc_8.f b/gcc/testsuite/gfortran.dg/reassoc_8.f

index a8aaa6008e81680e02d3ddf1e74403e877a28376..3a169010cdde1df55eb79a1d2b382cfcac066d52 100644 (file)
--- a/gcc/testsuite/gfortran.dg/reassoc_8.f
+++ b/gcc/testsuite/gfortran.dg/reassoc_8.f
@@ -1,5 +1,5 @@
  ! { dg-do compile }
-! { dg-options "-O3 -ffast-math -fdump-tree-optimized" }
+! { dg-options "-O3 -ffast-math -ffp-contract=off -fdump-tree-optimized" }
  
        SUBROUTINE S55199(P,Dvdph)
        implicit none
diff --git a/gcc/testsuite/gfortran.dg/reassoc_9.f b/gcc/testsuite/gfortran.dg/reassoc_9.f

index 5d9d15fa735bbfcd56ad62c31ae6fc91bf3c7de5..5ed53db1fdabd97805c086fa0ada00898b4c5cb6 100644 (file)
--- a/gcc/testsuite/gfortran.dg/reassoc_9.f
+++ b/gcc/testsuite/gfortran.dg/reassoc_9.f
@@ -1,5 +1,5 @@
  ! { dg-do compile }
-! { dg-options "-O3 -ffast-math -fdump-tree-optimized" }
+! { dg-options "-O3 -ffast-math -ffp-contract=off -fdump-tree-optimized" }
  
        SUBROUTINE S55199(P,Dvdph)
        implicit none
diff --git a/gcc/testsuite/lib/target-supports.exp b/gcc/testsuite/lib/target-supports.exp

index cfc22a2297587351ac7f8eed692adc9c42a69033..7ce727187266863f5c9ebc2ba147030f11022dde 100644 (file)
--- a/gcc/testsuite/lib/target-supports.exp
+++ b/gcc/testsuite/lib/target-supports.exp
@@ -2879,6 +2879,13 @@ proc check_effective_target_base_quadfloat_support { } {
      return 1
  }
  
+# Return 1 if the target supports all four forms of fused multiply-add
+# (fma, fms, fnma, and fnms) for both float and double.
+
+proc check_effective_target_scalar_all_fma { } {
+    return [istarget aarch64*-*-*]
+}
+
  # Return 1 if the target supports compiling fixed-point,
  # 0 otherwise.
  
diff --git a/gcc/tree-cfg.c b/gcc/tree-cfg.c

index 7f48d2dbe6587048fc31efaf0f1a446999c68a76..68f4fd3a7cc13ec2e87d5ae7db3126b46f1ac287 100644 (file)
--- a/gcc/tree-cfg.c
+++ b/gcc/tree-cfg.c
@@ -4109,20 +4109,6 @@ verify_gimple_assign_ternary (gassign *stmt)
         }
        break;
  
-    case FMA_EXPR:
-      if (!useless_type_conversion_p (lhs_type, rhs1_type)
-         || !useless_type_conversion_p (lhs_type, rhs2_type)
-         || !useless_type_conversion_p (lhs_type, rhs3_type))
-       {
-         error ("type mismatch in fused multiply-add expression");
-         debug_generic_expr (lhs_type);
-         debug_generic_expr (rhs1_type);
-         debug_generic_expr (rhs2_type);
-         debug_generic_expr (rhs3_type);
-         return true;
-       }
-      break;
-
      case VEC_COND_EXPR:
        if (!VECTOR_BOOLEAN_TYPE_P (rhs1_type)
           || maybe_ne (TYPE_VECTOR_SUBPARTS (rhs1_type),
diff --git a/gcc/tree-eh.c b/gcc/tree-eh.c

index 3609bca6eacc03cb2b5171ef423f5e5f84e470e1..30c6d9e3d02e4ee9d29120b9b0e6332673d90dd8 100644 (file)
--- a/gcc/tree-eh.c
+++ b/gcc/tree-eh.c
@@ -2512,8 +2512,7 @@ operation_could_trap_p (enum tree_code op, bool fp_operation, bool honor_trapv,
  
    if (TREE_CODE_CLASS (op) != tcc_comparison
        && TREE_CODE_CLASS (op) != tcc_unary
-      && TREE_CODE_CLASS (op) != tcc_binary
-      && op != FMA_EXPR)
+      && TREE_CODE_CLASS (op) != tcc_binary)
      return false;
  
    return operation_could_trap_helper_p (op, fp_operation, honor_trapv,
@@ -2825,8 +2824,7 @@ stmt_could_throw_1_p (gassign *stmt)
  
    if (TREE_CODE_CLASS (code) == tcc_comparison
        || TREE_CODE_CLASS (code) == tcc_unary
-      || TREE_CODE_CLASS (code) == tcc_binary
-      || code == FMA_EXPR)
+      || TREE_CODE_CLASS (code) == tcc_binary)
      {
        if (TREE_CODE_CLASS (code) == tcc_comparison)
         t = TREE_TYPE (gimple_assign_rhs1 (stmt));
diff --git a/gcc/tree-inline.c b/gcc/tree-inline.c

index 5a0a2525c0cab6ab2ae8acdc11818a0023fcbd5e..78811317e26b27bc985c0063d8b0ce659617738a 100644 (file)
--- a/gcc/tree-inline.c
+++ b/gcc/tree-inline.c
@@ -3855,7 +3855,6 @@ estimate_operator_cost (enum tree_code code, eni_weights *weights,
      case MINUS_EXPR:
      case MULT_EXPR:
      case MULT_HIGHPART_EXPR:
-    case FMA_EXPR:
  
      case ADDR_SPACE_CONVERT_EXPR:
      case FIXED_CONVERT_EXPR:
diff --git a/gcc/tree-pretty-print.c b/gcc/tree-pretty-print.c

index bc36c28643b8b420dc47b538c11667d22593c200..5a8c8eb79460785bb679aaa3cb634a6d6f2bce2d 100644 (file)
--- a/gcc/tree-pretty-print.c
+++ b/gcc/tree-pretty-print.c
@@ -2904,16 +2904,6 @@ dump_generic_node (pretty_printer *pp, tree node, int spc, dump_flags_t flags,
        pp_string (pp, " > ");
        break;
  
-    case FMA_EXPR:
-      pp_string (pp, " FMA_EXPR < ");
-      dump_generic_node (pp, TREE_OPERAND (node, 0), spc, flags, false);
-      pp_string (pp, ", ");
-      dump_generic_node (pp, TREE_OPERAND (node, 1), spc, flags, false);
-      pp_string (pp, ", ");
-      dump_generic_node (pp, TREE_OPERAND (node, 2), spc, flags, false);
-      pp_string (pp, " > ");
-      break;
-
      case OACC_PARALLEL:
        pp_string (pp, "#pragma acc parallel");
        goto dump_omp_clauses_body;
@@ -3552,7 +3542,6 @@ op_code_prio (enum tree_code code)
      case CEIL_MOD_EXPR:
      case FLOOR_MOD_EXPR:
      case ROUND_MOD_EXPR:
-    case FMA_EXPR:
        return 13;
  
      case TRUTH_NOT_EXPR:
diff --git a/gcc/tree-ssa-loop-im.c b/gcc/tree-ssa-loop-im.c

index 030aac0b616330f27506234904082b52f164ef55..01a954eeb1e52fdc341c8e4c114607675ab1d2a2 100644 (file)
--- a/gcc/tree-ssa-loop-im.c
+++ b/gcc/tree-ssa-loop-im.c
@@ -493,7 +493,6 @@ stmt_cost (gimple *stmt)
      case WIDEN_MULT_PLUS_EXPR:
      case WIDEN_MULT_MINUS_EXPR:
      case DOT_PROD_EXPR:
-    case FMA_EXPR:
      case TRUNC_DIV_EXPR:
      case CEIL_DIV_EXPR:
      case FLOOR_DIV_EXPR:
diff --git a/gcc/tree-ssa-math-opts.c b/gcc/tree-ssa-math-opts.c

index 16d9399af0b0922d1a4823b33c3880d41dc3cc90..273396da073e0aaefa9058aab4f53cd5e4900fb3 100644 (file)
--- a/gcc/tree-ssa-math-opts.c
+++ b/gcc/tree-ssa-math-opts.c
@@ -2650,7 +2650,7 @@ convert_mult_to_fma_1 (tree mul_result, tree op1, tree op2)
    tree type = TREE_TYPE (mul_result);
    gimple *use_stmt;
    imm_use_iterator imm_iter;
-  gassign *fma_stmt;
+  gcall *fma_stmt;
  
    FOR_EACH_IMM_USE_STMT (use_stmt, imm_iter, mul_result)
      {
@@ -2658,6 +2658,7 @@ convert_mult_to_fma_1 (tree mul_result, tree op1, tree op2)
        enum tree_code use_code;
        tree addop, mulop1 = op1, result = mul_result;
        bool negate_p = false;
+      gimple_seq seq = NULL;
  
        if (is_gimple_debug (use_stmt))
         continue;
@@ -2683,11 +2684,7 @@ convert_mult_to_fma_1 (tree mul_result, tree op1, tree op2)
           addop = gimple_assign_rhs2 (use_stmt);
           /* a * b - c -> a * b + (-c)  */
           if (gimple_assign_rhs_code (use_stmt) == MINUS_EXPR)
-           addop = force_gimple_operand_gsi (&gsi,
-                                             build1 (NEGATE_EXPR,
-                                                     type, addop),
-                                             true, NULL_TREE, true,
-                                             GSI_SAME_STMT);
+           addop = gimple_build (&seq, NEGATE_EXPR, type, addop);
         }
        else
         {
@@ -2698,23 +2695,26 @@ convert_mult_to_fma_1 (tree mul_result, tree op1, tree op2)
         }
  
        if (negate_p)
-       mulop1 = force_gimple_operand_gsi (&gsi,
-                                          build1 (NEGATE_EXPR,
-                                                  type, mulop1),
-                                          true, NULL_TREE, true,
-                                          GSI_SAME_STMT);
+       mulop1 = gimple_build (&seq, NEGATE_EXPR, type, mulop1);
  
-      fma_stmt = gimple_build_assign (gimple_assign_lhs (use_stmt),
-                                     FMA_EXPR, mulop1, op2, addop);
+      if (seq)
+       gsi_insert_seq_before (&gsi, seq, GSI_SAME_STMT);
+      fma_stmt = gimple_build_call_internal (IFN_FMA, 3, mulop1, op2, addop);
+      gimple_call_set_lhs (fma_stmt, gimple_assign_lhs (use_stmt));
+      gimple_call_set_nothrow (fma_stmt, !stmt_can_throw_internal (use_stmt));
+      gsi_replace (&gsi, fma_stmt, true);
+      /* Follow all SSA edges so that we generate FMS, FNMA and FNMS
+        regardless of where the negation occurs.  */
+      if (fold_stmt (&gsi, follow_all_ssa_edges))
+       update_stmt (gsi_stmt (gsi));
  
        if (dump_file && (dump_flags & TDF_DETAILS))
         {
           fprintf (dump_file, "Generated FMA ");
-         print_gimple_stmt (dump_file, fma_stmt, 0, 0);
+         print_gimple_stmt (dump_file, gsi_stmt (gsi), 0, 0);
           fprintf (dump_file, "\n");
         }
  
-      gsi_replace (&gsi, fma_stmt, true);
        widen_mul_stats.fmas_inserted++;
      }
  }
@@ -2862,7 +2862,8 @@ convert_mult_to_fma (gimple *mul_stmt, tree op1, tree op2,
  
    /* If the target doesn't support it, don't generate it.  We assume that
       if fma isn't available then fms, fnma or fnms are not either.  */
-  if (optab_handler (fma_optab, TYPE_MODE (type)) == CODE_FOR_nothing)
+  optimization_type opt_type = bb_optimization_type (gimple_bb (mul_stmt));
+  if (!direct_internal_fn_supported_p (IFN_FMA, type, opt_type))
      return false;
  
    /* If the multiplication has zero uses, it is kept around probably because
@@ -2958,8 +2959,8 @@ convert_mult_to_fma (gimple *mul_stmt, tree op1, tree op2,
          that a mult / subtract pair.  */
        if (use_code == MINUS_EXPR && !negate_p
           && gimple_assign_rhs1 (use_stmt) == result
-         && optab_handler (fms_optab, TYPE_MODE (type)) == CODE_FOR_nothing
-         && optab_handler (fnma_optab, TYPE_MODE (type)) != CODE_FOR_nothing)
+         && !direct_internal_fn_supported_p (IFN_FMS, type, opt_type)
+         && direct_internal_fn_supported_p (IFN_FNMA, type, opt_type))
         {
           tree rhs2 = gimple_assign_rhs2 (use_stmt);
  
diff --git a/gcc/tree-ssa-operands.c b/gcc/tree-ssa-operands.c

index 41dcb56001ddabe3096aa81e02a21ec99baa934f..eefe270c32d3f8521e6056d9ad63aaee8bff9a60 100644 (file)
--- a/gcc/tree-ssa-operands.c
+++ b/gcc/tree-ssa-operands.c
@@ -849,7 +849,6 @@ get_expr_operands (struct function *fn, gimple *stmt, tree *expr_p, int flags)
      case REALIGN_LOAD_EXPR:
      case WIDEN_MULT_PLUS_EXPR:
      case WIDEN_MULT_MINUS_EXPR:
-    case FMA_EXPR:
        {
         get_expr_operands (fn, stmt, &TREE_OPERAND (expr, 0), flags);
         get_expr_operands (fn, stmt, &TREE_OPERAND (expr, 1), flags);
diff --git a/gcc/tree.c b/gcc/tree.c

index 68165f4deed50bd72cddf477c1000369c16fcb9a..ff982fe43f865bc2e83c28c0be511df494cb5b6b 100644 (file)
--- a/gcc/tree.c
+++ b/gcc/tree.c
@@ -7171,7 +7171,6 @@ commutative_ternary_tree_code (enum tree_code code)
      case WIDEN_MULT_PLUS_EXPR:
      case WIDEN_MULT_MINUS_EXPR:
      case DOT_PROD_EXPR:
-    case FMA_EXPR:
        return true;
  
      default:
@@ -7457,7 +7456,6 @@ add_expr (const_tree t, inchash::hash &hstate, unsigned int flags)
               flags &= ~OEP_ADDRESS_OF;
               break;
  
-           case FMA_EXPR:
             case WIDEN_MULT_PLUS_EXPR:
             case WIDEN_MULT_MINUS_EXPR:
               {
diff --git a/gcc/tree.def b/gcc/tree.def

index 31de6c0994de43c175b924d4ba578a131fb4d524..c660b2c3f650fcccd36593e4403a3160a2b427a1 100644 (file)
--- a/gcc/tree.def
+++ b/gcc/tree.def
@@ -1345,12 +1345,6 @@ DEFTREECODE (WIDEN_MULT_MINUS_EXPR, "widen_mult_minus_expr", tcc_expression, 3)
     by the second argument.  */
  DEFTREECODE (WIDEN_LSHIFT_EXPR, "widen_lshift_expr", tcc_binary, 2)
  
-/* Fused multiply-add.
-   All operands and the result are of the same type.  No intermediate
-   rounding is performed after multiplying operand one with operand two
-   before adding operand three.  */
-DEFTREECODE (FMA_EXPR, "fma_expr", tcc_expression, 3)
-
  /* Widening vector multiplication.
     The two operands are vectors with N elements of size S. Multiplying the
     elements of the two vectors will result in N products of size 2*S.
author	Richard Sandiford <richard.sandiford@linaro.org>
	Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)
committer	Richard Sandiford <rsandifo@gcc.gnu.org>
	Fri, 18 May 2018 08:27:58 +0000 (08:27 +0000)
gcc/ChangeLog		patch \| blob \| history
gcc/Makefile.in		patch \| blob \| history
gcc/brig/ChangeLog		patch \| blob \| history
gcc/brig/brigfrontend/brig-function.cc		patch \| blob \| history
gcc/builtins.c		patch \| blob \| history
gcc/c/ChangeLog		patch \| blob \| history
gcc/c/gimple-parser.c		patch \| blob \| history
gcc/cfgexpand.c		patch \| blob \| history
gcc/config/i386/i386.c		patch \| blob \| history
gcc/config/rs6000/rs6000.c		patch \| blob \| history
gcc/cp/ChangeLog		patch \| blob \| history
gcc/cp/constexpr.c		patch \| blob \| history
gcc/doc/sourcebuild.texi		patch \| blob \| history
gcc/expr.c		patch \| blob \| history
gcc/fold-const-call.c		patch \| blob \| history
gcc/fold-const-call.h		patch \| blob \| history
gcc/fold-const.c		patch \| blob \| history
gcc/genmatch.c		patch \| blob \| history
gcc/gimple-fold.c		patch \| blob \| history
gcc/gimple-fold.h		patch \| blob \| history
gcc/gimple-pretty-print.c		patch \| blob \| history
gcc/gimple-ssa-backprop.c		patch \| blob \| history
gcc/gimple.c		patch \| blob \| history
gcc/gimplify.c		patch \| blob \| history
gcc/hsa-gen.c		patch \| blob \| history
gcc/internal-fn.c		patch \| blob \| history
gcc/internal-fn.def		patch \| blob \| history
gcc/match.pd		patch \| blob \| history
gcc/optabs-tree.c		patch \| blob \| history
gcc/testsuite/ChangeLog		patch \| blob \| history
gcc/testsuite/gcc.dg/fma-1.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-2.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-3.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-4.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-5.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-6.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/fma-7.c	[new file with mode: 0644]	patch \| blob
gcc/testsuite/gcc.dg/gimplefe-26.c		patch \| blob \| history
gcc/testsuite/gfortran.dg/reassoc_10.f		patch \| blob \| history
gcc/testsuite/gfortran.dg/reassoc_7.f		patch \| blob \| history
gcc/testsuite/gfortran.dg/reassoc_8.f		patch \| blob \| history
gcc/testsuite/gfortran.dg/reassoc_9.f		patch \| blob \| history
gcc/testsuite/lib/target-supports.exp		patch \| blob \| history
gcc/tree-cfg.c		patch \| blob \| history
gcc/tree-eh.c		patch \| blob \| history
gcc/tree-inline.c		patch \| blob \| history
gcc/tree-pretty-print.c		patch \| blob \| history
gcc/tree-ssa-loop-im.c		patch \| blob \| history
gcc/tree-ssa-math-opts.c		patch \| blob \| history
gcc/tree-ssa-operands.c		patch \| blob \| history
gcc/tree.c		patch \| blob \| history
gcc/tree.def		patch \| blob \| history